Overview
Brought to you by YData
Dataset statistics
| Number of variables | 139 |
|---|---|
| Number of observations | 455212 |
| Missing cells | 33080502 |
| Missing cells (%) | 52.3% |
| Total size in memory | 482.7 MiB |
| Average record size in memory | 1.1 KiB |
Variable types
| Text | 139 |
|---|
Dataset
| Description | Fish NMNH Extant Specimen Records 0055081-241126133413365 |
|---|---|
| URL | https://doi.org/10.15468/dl.34mb2x |
license has constant value "CC0_1_0" | Constant |
publisher has constant value "National Museum of Natural History, Smithsonian Institution" | Constant |
institutionID has constant value "urn:lsid:biocol.org:col:34871" | Constant |
collectionID has constant value "urn:uuid:09c9cf5f-f5d3-48cc-b5c8-cd9b9fbd631f" | Constant |
institutionCode has constant value "USNM" | Constant |
collectionCode has constant value "FISH" | Constant |
datasetName has constant value "NMNH Extant Biology" | Constant |
sex has constant value "MALE" | Constant |
eventID has constant value "941.0" | Constant |
minimumDistanceAboveSurfaceInMeters has constant value "Williams, Jeffrey T." | Constant |
earliestEraOrLowestErathem has constant value "Animalia" | Constant |
latestEraOrHighestErathem has constant value "Chordata" | Constant |
verbatimIdentification has constant value "SPECIES" | Constant |
identifiedByID has constant value "ACCEPTED" | Constant |
identificationVerificationStatus has constant value "821cc27a-e3bb-4bc5-ac34-89ada245069d" | Constant |
identificationRemarks has constant value "US" | Constant |
taxonConceptID has constant value "StillImage" | Constant |
acceptedNameUsage has constant value "false" | Constant |
nameAccordingTo has constant value "1" | Constant |
namePublishedIn has constant value "44" | Constant |
subtribe has constant value "EML" | Constant |
nomenclaturalStatus has constant value "PHL.36.21_1" | Constant |
taxonRemarks has constant value "Iloilo City" | Constant |
protocol has constant value "EML" | Constant |
lastCrawled has constant value "2024-12-02T11:48:23.416Z" | Constant |
publishedByGbifRegion has constant value "NORTH_AMERICA" | Constant |
recordNumber has 434386 (95.4%) missing values | Missing |
recordedBy has 287312 (63.1%) missing values | Missing |
sex has 455209 (> 99.9%) missing values | Missing |
preparations has 346184 (76.0%) missing values | Missing |
associatedSequences has 454762 (99.9%) missing values | Missing |
occurrenceRemarks has 290485 (63.8%) missing values | Missing |
verbatimLabel has 455209 (> 99.9%) missing values | Missing |
materialSampleID has 455209 (> 99.9%) missing values | Missing |
eventID has 455211 (> 99.9%) missing values | Missing |
fieldNumber has 274211 (60.2%) missing values | Missing |
eventDate has 60241 (13.2%) missing values | Missing |
startDayOfYear has 91500 (20.1%) missing values | Missing |
endDayOfYear has 91500 (20.1%) missing values | Missing |
year has 60500 (13.3%) missing values | Missing |
month has 82757 (18.2%) missing values | Missing |
day has 108703 (23.9%) missing values | Missing |
verbatimEventDate has 92472 (20.3%) missing values | Missing |
locationID has 352012 (77.3%) missing values | Missing |
higherGeography has 20492 (4.5%) missing values | Missing |
continent has 162647 (35.7%) missing values | Missing |
waterBody has 133275 (29.3%) missing values | Missing |
islandGroup has 390811 (85.9%) missing values | Missing |
island has 270596 (59.4%) missing values | Missing |
countryCode has 30434 (6.7%) missing values | Missing |
stateProvince has 174301 (38.3%) missing values | Missing |
county has 357533 (78.5%) missing values | Missing |
locality has 45084 (9.9%) missing values | Missing |
verbatimElevation has 453008 (99.5%) missing values | Missing |
verbatimDepth has 446636 (98.1%) missing values | Missing |
minimumDistanceAboveSurfaceInMeters has 455211 (> 99.9%) missing values | Missing |
decimalLatitude has 254257 (55.9%) missing values | Missing |
decimalLongitude has 254257 (55.9%) missing values | Missing |
coordinateUncertaintyInMeters has 450059 (98.9%) missing values | Missing |
pointRadiusSpatialFit has 455205 (> 99.9%) missing values | Missing |
verbatimCoordinateSystem has 308939 (67.9%) missing values | Missing |
georeferencedBy has 455205 (> 99.9%) missing values | Missing |
georeferenceProtocol has 437832 (96.2%) missing values | Missing |
georeferenceRemarks has 432197 (94.9%) missing values | Missing |
latestEonOrHighestEonothem has 455205 (> 99.9%) missing values | Missing |
earliestEraOrLowestErathem has 455205 (> 99.9%) missing values | Missing |
latestEraOrHighestErathem has 455205 (> 99.9%) missing values | Missing |
latestPeriodOrHighestSystem has 455205 (> 99.9%) missing values | Missing |
latestEpochOrHighestSeries has 455205 (> 99.9%) missing values | Missing |
highestBiostratigraphicZone has 455205 (> 99.9%) missing values | Missing |
lithostratigraphicTerms has 455205 (> 99.9%) missing values | Missing |
member has 455205 (> 99.9%) missing values | Missing |
verbatimIdentification has 455205 (> 99.9%) missing values | Missing |
identificationQualifier has 453516 (99.6%) missing values | Missing |
typeStatus has 436448 (95.9%) missing values | Missing |
identifiedBy has 421073 (92.5%) missing values | Missing |
identifiedByID has 455205 (> 99.9%) missing values | Missing |
identificationVerificationStatus has 455205 (> 99.9%) missing values | Missing |
identificationRemarks has 455205 (> 99.9%) missing values | Missing |
taxonID has 455205 (> 99.9%) missing values | Missing |
parentNameUsageID has 455209 (> 99.9%) missing values | Missing |
originalNameUsageID has 455209 (> 99.9%) missing values | Missing |
namePublishedInID has 455205 (> 99.9%) missing values | Missing |
taxonConceptID has 455210 (> 99.9%) missing values | Missing |
acceptedNameUsage has 455205 (> 99.9%) missing values | Missing |
parentNameUsage has 455205 (> 99.9%) missing values | Missing |
originalNameUsage has 455205 (> 99.9%) missing values | Missing |
nameAccordingTo has 455205 (> 99.9%) missing values | Missing |
namePublishedIn has 455205 (> 99.9%) missing values | Missing |
class has 444746 (97.7%) missing values | Missing |
superfamily has 455205 (> 99.9%) missing values | Missing |
subfamily has 455205 (> 99.9%) missing values | Missing |
subtribe has 455205 (> 99.9%) missing values | Missing |
genus has 23586 (5.2%) missing values | Missing |
genericName has 23579 (5.2%) missing values | Missing |
subgenus has 455206 (> 99.9%) missing values | Missing |
specificEpithet has 70259 (15.4%) missing values | Missing |
infraspecificEpithet has 447018 (98.2%) missing values | Missing |
cultivarEpithet has 455206 (> 99.9%) missing values | Missing |
verbatimTaxonRank has 455210 (> 99.9%) missing values | Missing |
vernacularName has 455210 (> 99.9%) missing values | Missing |
nomenclaturalCode has 455210 (> 99.9%) missing values | Missing |
nomenclaturalStatus has 455211 (> 99.9%) missing values | Missing |
taxonRemarks has 455211 (> 99.9%) missing values | Missing |
depth has 246174 (54.1%) missing values | Missing |
depthAccuracy has 266866 (58.6%) missing values | Missing |
distanceFromCentroidInMeters has 454306 (99.8%) missing values | Missing |
mediaType has 363819 (79.9%) missing values | Missing |
classKey has 444746 (97.7%) missing values | Missing |
genusKey has 23593 (5.2%) missing values | Missing |
speciesKey has 70260 (15.4%) missing values | Missing |
species has 70260 (15.4%) missing values | Missing |
repatriated has 30397 (6.7%) missing values | Missing |
gbifRegion has 32195 (7.1%) missing values | Missing |
level0Gid has 407295 (89.5%) missing values | Missing |
level0Name has 407295 (89.5%) missing values | Missing |
level1Gid has 408402 (89.7%) missing values | Missing |
level1Name has 408402 (89.7%) missing values | Missing |
level2Gid has 412023 (90.5%) missing values | Missing |
level2Name has 412026 (90.5%) missing values | Missing |
level3Gid has 441377 (97.0%) missing values | Missing |
level3Name has 441442 (97.0%) missing values | Missing |
iucnRedListCategory has 11501 (2.5%) missing values | Missing |
gbifID has unique values | Unique |
occurrenceID has unique values | Unique |
Reproduction
| Analysis started | 2025-01-08 22:56:37.908163 |
|---|---|
| Analysis finished | 2025-01-08 22:57:00.805949 |
| Duration | 22.9 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
gbifID
Text
Unique 
| Distinct | 455212 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.5 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 455212 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1317202656 |
|---|---|
| 2nd row | 1317202715 |
| 3rd row | 1322535976 |
| 4th row | 1317203467 |
| 5th row | 2235732924 |
| Value | Count | Frequency (%) |
| 1317202656 | 1 | < 0.1% |
| 1322550703 | 1 | < 0.1% |
| 1317206835 | 1 | < 0.1% |
| 1322539466 | 1 | < 0.1% |
| 2235733055 | 1 | < 0.1% |
| 1322541352 | 1 | < 0.1% |
| 1843575433 | 1 | < 0.1% |
| 1843575436 | 1 | < 0.1% |
| 1322545228 | 1 | < 0.1% |
| 3467167330 | 1 | < 0.1% |
| Other values (455202) | 455202 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 954765 | |
| 3 | 714943 | |
| 2 | 571998 | |
| 8 | 367591 | 8.1% |
| 0 | 348573 | 7.7% |
| 9 | 346657 | 7.6% |
| 7 | 345233 | 7.6% |
| 4 | 314224 | 6.9% |
| 5 | 302152 | 6.6% |
| 6 | 285984 | 6.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4552120 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 954765 | |
| 3 | 714943 | |
| 2 | 571998 | |
| 8 | 367591 | 8.1% |
| 0 | 348573 | 7.7% |
| 9 | 346657 | 7.6% |
| 7 | 345233 | 7.6% |
| 4 | 314224 | 6.9% |
| 5 | 302152 | 6.6% |
| 6 | 285984 | 6.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4552120 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 954765 | |
| 3 | 714943 | |
| 2 | 571998 | |
| 8 | 367591 | 8.1% |
| 0 | 348573 | 7.7% |
| 9 | 346657 | 7.6% |
| 7 | 345233 | 7.6% |
| 4 | 314224 | 6.9% |
| 5 | 302152 | 6.6% |
| 6 | 285984 | 6.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4552120 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 954765 | |
| 3 | 714943 | |
| 2 | 571998 | |
| 8 | 367591 | 8.1% |
| 0 | 348573 | 7.7% |
| 9 | 346657 | 7.6% |
| 7 | 345233 | 7.6% |
| 4 | 314224 | 6.9% |
| 5 | 302152 | 6.6% |
| 6 | 285984 | 6.3% |
license
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.5 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CC0_1_0 |
|---|---|
| 2nd row | CC0_1_0 |
| 3rd row | CC0_1_0 |
| 4th row | CC0_1_0 |
| 5th row | CC0_1_0 |
| Value | Count | Frequency (%) |
| cc0_1_0 | 455212 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 910424 | |
| 0 | 910424 | |
| _ | 910424 | |
| 1 | 455212 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1365636 | |
| Uppercase Letter | 910424 | |
| Connector Punctuation | 910424 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 910424 | |
| 1 | 455212 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 910424 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 910424 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2276060 | |
| Latin | 910424 | 28.6% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 910424 | |
| _ | 910424 | |
| 1 | 455212 |
Latin
| Value | Count | Frequency (%) |
| C | 910424 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3186484 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 910424 | |
| 0 | 910424 | |
| _ | 910424 | |
| 1 | 455212 |
modified
Text
| Distinct | 55507 |
|---|---|
| Distinct (%) | 12.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.5 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 20 |
| Mean length | 20 |
| Min length | 20 |
Unique
| Unique | 31272 ? |
|---|---|
| Unique (%) | 6.9% |
Sample
| 1st row | 2023-06-02T12:34:00Z |
|---|---|
| 2nd row | 2019-11-27T11:21:00Z |
| 3rd row | 2018-02-21T11:18:00Z |
| 4th row | 2020-03-23T11:52:00Z |
| 5th row | 2019-07-18T12:15:00Z |
| Value | Count | Frequency (%) |
| 2022-09-13t10:13:00z | 2762 | 0.6% |
| 2015-04-16t13:10:00z | 2261 | 0.5% |
| 2018-07-27t10:48:00z | 2063 | 0.5% |
| 2017-12-01t13:03:00z | 2039 | 0.4% |
| 2017-08-29t08:37:00z | 1935 | 0.4% |
| 2018-07-27t10:44:00z | 1898 | 0.4% |
| 2019-07-18t12:17:00z | 1876 | 0.4% |
| 2017-12-18t13:20:00z | 1847 | 0.4% |
| 2017-12-04t11:22:00z | 1814 | 0.4% |
| 2019-07-18t12:15:00z | 1731 | 0.4% |
| Other values (55497) | 434986 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2223228 | |
| 1 | 1214237 | |
| 2 | 1142482 | |
| - | 910424 | |
| : | 910424 | |
| T | 455212 | 5.0% |
| Z | 455212 | 5.0% |
| 4 | 380598 | 4.2% |
| 8 | 314110 | 3.5% |
| 3 | 305429 | 3.4% |
| Other values (4) | 792884 | 8.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6372968 | |
| Dash Punctuation | 910424 | 10.0% |
| Other Punctuation | 910424 | 10.0% |
| Uppercase Letter | 910424 | 10.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2223228 | |
| 1 | 1214237 | |
| 2 | 1142482 | |
| 4 | 380598 | 6.0% |
| 8 | 314110 | 4.9% |
| 3 | 305429 | 4.8% |
| 5 | 267556 | 4.2% |
| 7 | 199859 | 3.1% |
| 9 | 176646 | 2.8% |
| 6 | 148823 | 2.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 455212 | |
| Z | 455212 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 910424 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 910424 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8193816 | |
| Latin | 910424 | 10.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2223228 | |
| 1 | 1214237 | |
| 2 | 1142482 | |
| - | 910424 | |
| : | 910424 | |
| 4 | 380598 | 4.6% |
| 8 | 314110 | 3.8% |
| 3 | 305429 | 3.7% |
| 5 | 267556 | 3.3% |
| 7 | 199859 | 2.4% |
| Other values (2) | 325469 | 4.0% |
Latin
| Value | Count | Frequency (%) |
| T | 455212 | |
| Z | 455212 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9104240 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2223228 | |
| 1 | 1214237 | |
| 2 | 1142482 | |
| - | 910424 | |
| : | 910424 | |
| T | 455212 | 5.0% |
| Z | 455212 | 5.0% |
| 4 | 380598 | 4.2% |
| 8 | 314110 | 3.5% |
| 3 | 305429 | 3.4% |
| Other values (4) | 792884 | 8.7% |
publisher
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.5 MiB |
Length
| Max length | 59 |
|---|---|
| Median length | 59 |
| Mean length | 59 |
| Min length | 59 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | National Museum of Natural History, Smithsonian Institution |
|---|---|
| 2nd row | National Museum of Natural History, Smithsonian Institution |
| 3rd row | National Museum of Natural History, Smithsonian Institution |
| 4th row | National Museum of Natural History, Smithsonian Institution |
| 5th row | National Museum of Natural History, Smithsonian Institution |
| Value | Count | Frequency (%) |
| national | 455212 | |
| museum | 455212 | |
| of | 455212 | |
| natural | 455212 | |
| history | 455212 | |
| smithsonian | 455212 | |
| institution | 455212 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 3186484 | |
| i | 2731272 | |
| 2731272 | ||
| a | 2276060 | 8.5% |
| o | 2276060 | 8.5% |
| n | 2276060 | 8.5% |
| s | 1820848 | 6.8% |
| u | 1820848 | 6.8% |
| r | 910424 | 3.4% |
| m | 910424 | 3.4% |
| Other values (11) | 5917756 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 20939752 | |
| Space Separator | 2731272 | 10.2% |
| Uppercase Letter | 2731272 | 10.2% |
| Other Punctuation | 455212 | 1.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 3186484 | |
| i | 2731272 | |
| a | 2276060 | |
| o | 2276060 | |
| n | 2276060 | |
| s | 1820848 | |
| u | 1820848 | |
| r | 910424 | 4.3% |
| m | 910424 | 4.3% |
| l | 910424 | 4.3% |
| Other values (4) | 1820848 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 910424 | |
| M | 455212 | |
| H | 455212 | |
| S | 455212 | |
| I | 455212 |
Space Separator
| Value | Count | Frequency (%) |
| 2731272 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 455212 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23671024 | |
| Common | 3186484 | 11.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 3186484 | |
| i | 2731272 | |
| a | 2276060 | |
| o | 2276060 | |
| n | 2276060 | |
| s | 1820848 | 7.7% |
| u | 1820848 | 7.7% |
| r | 910424 | 3.8% |
| m | 910424 | 3.8% |
| N | 910424 | 3.8% |
| Other values (9) | 4552120 |
Common
| Value | Count | Frequency (%) |
| 2731272 | ||
| , | 455212 | 14.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 26857508 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 3186484 | |
| i | 2731272 | |
| 2731272 | ||
| a | 2276060 | 8.5% |
| o | 2276060 | 8.5% |
| n | 2276060 | 8.5% |
| s | 1820848 | 6.8% |
| u | 1820848 | 6.8% |
| r | 910424 | 3.4% |
| m | 910424 | 3.4% |
| Other values (11) | 5917756 |
institutionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.5 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 29 |
| Mean length | 29 |
| Min length | 29 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | urn:lsid:biocol.org:col:34871 |
|---|---|
| 2nd row | urn:lsid:biocol.org:col:34871 |
| 3rd row | urn:lsid:biocol.org:col:34871 |
| 4th row | urn:lsid:biocol.org:col:34871 |
| 5th row | urn:lsid:biocol.org:col:34871 |
| Value | Count | Frequency (%) |
| urn:lsid:biocol.org:col:34871 | 455212 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 1820848 | |
| : | 1820848 | |
| l | 1365636 | 10.3% |
| i | 910424 | 6.9% |
| r | 910424 | 6.9% |
| c | 910424 | 6.9% |
| g | 455212 | 3.4% |
| 7 | 455212 | 3.4% |
| 8 | 455212 | 3.4% |
| 4 | 455212 | 3.4% |
| Other values (8) | 3641696 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8649028 | |
| Other Punctuation | 2276060 | 17.2% |
| Decimal Number | 2276060 | 17.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 1820848 | |
| l | 1365636 | |
| i | 910424 | |
| r | 910424 | |
| c | 910424 | |
| g | 455212 | 5.3% |
| u | 455212 | 5.3% |
| b | 455212 | 5.3% |
| d | 455212 | 5.3% |
| s | 455212 | 5.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 455212 | |
| 8 | 455212 | |
| 4 | 455212 | |
| 3 | 455212 | |
| 1 | 455212 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1820848 | |
| . | 455212 | 20.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8649028 | |
| Common | 4552120 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 1820848 | |
| l | 1365636 | |
| i | 910424 | |
| r | 910424 | |
| c | 910424 | |
| g | 455212 | 5.3% |
| u | 455212 | 5.3% |
| b | 455212 | 5.3% |
| d | 455212 | 5.3% |
| s | 455212 | 5.3% |
Common
| Value | Count | Frequency (%) |
| : | 1820848 | |
| 7 | 455212 | 10.0% |
| 8 | 455212 | 10.0% |
| 4 | 455212 | 10.0% |
| 3 | 455212 | 10.0% |
| . | 455212 | 10.0% |
| 1 | 455212 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13201148 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 1820848 | |
| : | 1820848 | |
| l | 1365636 | 10.3% |
| i | 910424 | 6.9% |
| r | 910424 | 6.9% |
| c | 910424 | 6.9% |
| g | 455212 | 3.4% |
| 7 | 455212 | 3.4% |
| 8 | 455212 | 3.4% |
| 4 | 455212 | 3.4% |
| Other values (8) | 3641696 |
collectionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.5 MiB |
Length
| Max length | 45 |
|---|---|
| Median length | 45 |
| Mean length | 45 |
| Min length | 45 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | urn:uuid:09c9cf5f-f5d3-48cc-b5c8-cd9b9fbd631f |
|---|---|
| 2nd row | urn:uuid:09c9cf5f-f5d3-48cc-b5c8-cd9b9fbd631f |
| 3rd row | urn:uuid:09c9cf5f-f5d3-48cc-b5c8-cd9b9fbd631f |
| 4th row | urn:uuid:09c9cf5f-f5d3-48cc-b5c8-cd9b9fbd631f |
| 5th row | urn:uuid:09c9cf5f-f5d3-48cc-b5c8-cd9b9fbd631f |
| Value | Count | Frequency (%) |
| urn:uuid:09c9cf5f-f5d3-48cc-b5c8-cd9b9fbd631f | 455212 |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 2731272 | |
| f | 2276060 | |
| 9 | 1820848 | |
| - | 1820848 | |
| d | 1820848 | |
| b | 1365636 | 6.7% |
| 5 | 1365636 | 6.7% |
| u | 1365636 | 6.7% |
| : | 910424 | 4.4% |
| 3 | 910424 | 4.4% |
| Other values (8) | 4096908 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10925088 | |
| Decimal Number | 6828180 | |
| Dash Punctuation | 1820848 | 8.9% |
| Other Punctuation | 910424 | 4.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 2731272 | |
| f | 2276060 | |
| d | 1820848 | |
| b | 1365636 | |
| u | 1365636 | |
| r | 455212 | 4.2% |
| i | 455212 | 4.2% |
| n | 455212 | 4.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 1820848 | |
| 5 | 1365636 | |
| 3 | 910424 | |
| 8 | 910424 | |
| 0 | 455212 | 6.7% |
| 4 | 455212 | 6.7% |
| 6 | 455212 | 6.7% |
| 1 | 455212 | 6.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1820848 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 910424 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10925088 | |
| Common | 9559452 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 9 | 1820848 | |
| - | 1820848 | |
| 5 | 1365636 | |
| : | 910424 | |
| 3 | 910424 | |
| 8 | 910424 | |
| 0 | 455212 | 4.8% |
| 4 | 455212 | 4.8% |
| 6 | 455212 | 4.8% |
| 1 | 455212 | 4.8% |
Latin
| Value | Count | Frequency (%) |
| c | 2731272 | |
| f | 2276060 | |
| d | 1820848 | |
| b | 1365636 | |
| u | 1365636 | |
| r | 455212 | 4.2% |
| i | 455212 | 4.2% |
| n | 455212 | 4.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20484540 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 2731272 | |
| f | 2276060 | |
| 9 | 1820848 | |
| - | 1820848 | |
| d | 1820848 | |
| b | 1365636 | 6.7% |
| 5 | 1365636 | 6.7% |
| u | 1365636 | 6.7% |
| : | 910424 | 4.4% |
| 3 | 910424 | 4.4% |
| Other values (8) | 4096908 |
institutionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.5 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | USNM |
|---|---|
| 2nd row | USNM |
| 3rd row | USNM |
| 4th row | USNM |
| 5th row | USNM |
| Value | Count | Frequency (%) |
| usnm | 455212 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 455212 | |
| S | 455212 | |
| N | 455212 | |
| M | 455212 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1820848 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 455212 | |
| S | 455212 | |
| N | 455212 | |
| M | 455212 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1820848 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 455212 | |
| S | 455212 | |
| N | 455212 | |
| M | 455212 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1820848 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 455212 | |
| S | 455212 | |
| N | 455212 | |
| M | 455212 |
collectionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.5 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | FISH |
|---|---|
| 2nd row | FISH |
| 3rd row | FISH |
| 4th row | FISH |
| 5th row | FISH |
| Value | Count | Frequency (%) |
| fish | 455212 |
Most occurring characters
| Value | Count | Frequency (%) |
| F | 455212 | |
| I | 455212 | |
| S | 455212 | |
| H | 455212 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1820848 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 455212 | |
| I | 455212 | |
| S | 455212 | |
| H | 455212 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1820848 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| F | 455212 | |
| I | 455212 | |
| S | 455212 | |
| H | 455212 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1820848 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| F | 455212 | |
| I | 455212 | |
| S | 455212 | |
| H | 455212 |
datasetName
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.5 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NMNH Extant Biology |
|---|---|
| 2nd row | NMNH Extant Biology |
| 3rd row | NMNH Extant Biology |
| 4th row | NMNH Extant Biology |
| 5th row | NMNH Extant Biology |
| Value | Count | Frequency (%) |
| nmnh | 455212 | |
| extant | 455212 | |
| biology | 455212 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 910424 | 10.5% |
| 910424 | 10.5% | |
| t | 910424 | 10.5% |
| o | 910424 | 10.5% |
| M | 455212 | 5.3% |
| H | 455212 | 5.3% |
| E | 455212 | 5.3% |
| x | 455212 | 5.3% |
| a | 455212 | 5.3% |
| n | 455212 | 5.3% |
| Other values (5) | 2276060 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5007332 | |
| Uppercase Letter | 2731272 | |
| Space Separator | 910424 | 10.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 910424 | |
| o | 910424 | |
| x | 455212 | |
| a | 455212 | |
| n | 455212 | |
| i | 455212 | |
| l | 455212 | |
| g | 455212 | |
| y | 455212 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 910424 | |
| M | 455212 | |
| H | 455212 | |
| E | 455212 | |
| B | 455212 |
Space Separator
| Value | Count | Frequency (%) |
| 910424 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7738604 | |
| Common | 910424 | 10.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 910424 | |
| t | 910424 | |
| o | 910424 | |
| M | 455212 | 5.9% |
| H | 455212 | 5.9% |
| E | 455212 | 5.9% |
| x | 455212 | 5.9% |
| a | 455212 | 5.9% |
| n | 455212 | 5.9% |
| B | 455212 | 5.9% |
| Other values (4) | 1820848 |
Common
| Value | Count | Frequency (%) |
| 910424 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8649028 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 910424 | 10.5% |
| 910424 | 10.5% | |
| t | 910424 | 10.5% |
| o | 910424 | 10.5% |
| M | 455212 | 5.3% |
| H | 455212 | 5.3% |
| E | 455212 | 5.3% |
| x | 455212 | 5.3% |
| a | 455212 | 5.3% |
| n | 455212 | 5.3% |
| Other values (5) | 2276060 |
basisOfRecord
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.5 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 18 |
| Mean length | 18.08130717 |
| Min length | 18 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PRESERVED_SPECIMEN |
|---|---|
| 2nd row | PRESERVED_SPECIMEN |
| 3rd row | PRESERVED_SPECIMEN |
| 4th row | PRESERVED_SPECIMEN |
| 5th row | MACHINE_OBSERVATION |
| Value | Count | Frequency (%) |
| preserved_specimen | 418200 | |
| machine_observation | 37012 | 8.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 2165024 | |
| R | 873412 | |
| S | 873412 | |
| P | 836400 | 10.2% |
| I | 492224 | 6.0% |
| N | 492224 | 6.0% |
| V | 455212 | 5.5% |
| _ | 455212 | 5.5% |
| C | 455212 | 5.5% |
| M | 455212 | 5.5% |
| Other values (6) | 677284 | 8.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 7775616 | |
| Connector Punctuation | 455212 | 5.5% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 2165024 | |
| R | 873412 | |
| S | 873412 | |
| P | 836400 | 10.8% |
| I | 492224 | 6.3% |
| N | 492224 | 6.3% |
| V | 455212 | 5.9% |
| C | 455212 | 5.9% |
| M | 455212 | 5.9% |
| D | 418200 | 5.4% |
| Other values (5) | 259084 | 3.3% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 455212 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7775616 | |
| Common | 455212 | 5.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 2165024 | |
| R | 873412 | |
| S | 873412 | |
| P | 836400 | 10.8% |
| I | 492224 | 6.3% |
| N | 492224 | 6.3% |
| V | 455212 | 5.9% |
| C | 455212 | 5.9% |
| M | 455212 | 5.9% |
| D | 418200 | 5.4% |
| Other values (5) | 259084 | 3.3% |
Common
| Value | Count | Frequency (%) |
| _ | 455212 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8230828 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 2165024 | |
| R | 873412 | |
| S | 873412 | |
| P | 836400 | 10.2% |
| I | 492224 | 6.0% |
| N | 492224 | 6.0% |
| V | 455212 | 5.5% |
| _ | 455212 | 5.5% |
| C | 455212 | 5.5% |
| M | 455212 | 5.5% |
| Other values (6) | 677284 | 8.2% |
occurrenceID
Text
Unique 
| Distinct | 455212 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.5 MiB |
Length
| Max length | 63 |
|---|---|
| Median length | 63 |
| Mean length | 63 |
| Min length | 63 |
Unique
| Unique | 455212 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | http://n2t.net/ark:/65665/30002bab5-5433-4b6c-8496-286a4a697fd7 |
|---|---|
| 2nd row | http://n2t.net/ark:/65665/3000315ff-b613-4f47-813c-5c48d8e0a883 |
| 3rd row | http://n2t.net/ark:/65665/3ebef4ab3-d946-4961-9221-c7c9692640f8 |
| 4th row | http://n2t.net/ark:/65665/3000bbb81-e139-47f8-b2bc-db762804769d |
| 5th row | http://n2t.net/ark:/65665/3002333ca-4702-4d0d-93cd-265885eff56a |
| Value | Count | Frequency (%) |
| http://n2t.net/ark:/65665/30002bab5-5433-4b6c-8496-286a4a697fd7 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3ec95962f-5e2d-41b1-9854-38d77fc2256f | 1 | < 0.1% |
| http://n2t.net/ark:/65665/300319b93-d6b8-4b79-b23d-d3825483b706 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3ec168a54-17b9-4a71-9ecc-92d446311c64 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/300370a00-b7af-441b-87cd-9c14a7b5b464 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3ec2b37a7-5b92-4aa2-ad75-a247e8e353f8 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3ec3b1e55-b813-49b8-83c7-eadc9323514e | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3ec3f4a3d-f79a-4cba-b8e4-022b57aa27d6 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3ec57b74b-7fc8-4006-ad93-945fb0784573 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/30280d648-bf0c-4907-af3e-5cdc58054b4e | 1 | < 0.1% |
| Other values (455202) | 455202 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 2276060 | 7.9% |
| 6 | 2219127 | 7.7% |
| - | 1820848 | 6.3% |
| t | 1820848 | 6.3% |
| 5 | 1762661 | 6.1% |
| a | 1423503 | 5.0% |
| 2 | 1308759 | 4.6% |
| 4 | 1308747 | 4.6% |
| e | 1308683 | 4.6% |
| 3 | 1308534 | 4.6% |
| Other values (16) | 12120586 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 12402148 | |
| Lowercase Letter | 10813664 | |
| Other Punctuation | 3641696 | 12.7% |
| Dash Punctuation | 1820848 | 6.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 1820848 | |
| a | 1423503 | |
| e | 1308683 | |
| b | 969407 | |
| n | 910424 | |
| f | 853663 | |
| d | 853473 | |
| c | 852815 | |
| k | 455212 | 4.2% |
| r | 455212 | 4.2% |
| Other values (2) | 910424 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 2219127 | |
| 5 | 1762661 | |
| 2 | 1308759 | |
| 4 | 1308747 | |
| 3 | 1308534 | |
| 9 | 967258 | |
| 8 | 966762 | |
| 0 | 854227 | 6.9% |
| 1 | 853581 | 6.9% |
| 7 | 852492 | 6.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 2276060 | |
| : | 910424 | 25.0% |
| . | 455212 | 12.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1820848 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 17864692 | |
| Latin | 10813664 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| / | 2276060 | |
| 6 | 2219127 | |
| - | 1820848 | |
| 5 | 1762661 | |
| 2 | 1308759 | |
| 4 | 1308747 | |
| 3 | 1308534 | |
| 9 | 967258 | 5.4% |
| 8 | 966762 | 5.4% |
| : | 910424 | 5.1% |
| Other values (4) | 3015512 |
Latin
| Value | Count | Frequency (%) |
| t | 1820848 | |
| a | 1423503 | |
| e | 1308683 | |
| b | 969407 | |
| n | 910424 | |
| f | 853663 | |
| d | 853473 | |
| c | 852815 | |
| k | 455212 | 4.2% |
| r | 455212 | 4.2% |
| Other values (2) | 910424 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 28678356 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 2276060 | 7.9% |
| 6 | 2219127 | 7.7% |
| - | 1820848 | 6.3% |
| t | 1820848 | 6.3% |
| 5 | 1762661 | 6.1% |
| a | 1423503 | 5.0% |
| 2 | 1308759 | 4.6% |
| 4 | 1308747 | 4.6% |
| e | 1308683 | 4.6% |
| 3 | 1308534 | 4.6% |
| Other values (16) | 12120586 |
catalogNumber
Text
| Distinct | 455204 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Memory size | 3.5 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 11 |
| Mean length | 11.04621613 |
| Min length | 6 |
Unique
| Unique | 455199 ? |
|---|---|
| Unique (%) | > 99.9% |
Sample
| 1st row | USNM 51082 |
|---|---|
| 2nd row | USNM 110432 |
| 3rd row | USNM 49860 |
| 4th row | USNM 239751 |
| 5th row | USNM RAD122557 |
| Value | Count | Frequency (%) |
| usnm | 455209 | |
| 465983 | 2 | < 0.1% |
| 466814 | 2 | < 0.1% |
| 135878 | 2 | < 0.1% |
| 114351 | 2 | < 0.1% |
| rad125895 | 2 | < 0.1% |
| fin30680 | 1 | < 0.1% |
| 253658 | 1 | < 0.1% |
| 457486 | 1 | < 0.1% |
| 97025 | 1 | < 0.1% |
| Other values (455195) | 455195 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 464924 | 9.2% |
| U | 455209 | 9.1% |
| M | 455209 | 9.1% |
| 455209 | 9.1% | |
| S | 455209 | 9.1% |
| 1 | 348722 | 6.9% |
| 2 | 335206 | 6.7% |
| 3 | 330422 | 6.6% |
| 4 | 286516 | 5.7% |
| 6 | 228754 | 4.5% |
| Other values (10) | 1212957 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2644490 | |
| Uppercase Letter | 1928638 | |
| Space Separator | 455209 | 9.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 348722 | |
| 2 | 335206 | |
| 3 | 330422 | |
| 4 | 286516 | |
| 6 | 228754 | |
| 0 | 227699 | |
| 7 | 226592 | |
| 5 | 223329 | |
| 9 | 219353 | |
| 8 | 217897 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 464924 | |
| U | 455209 | |
| M | 455209 | |
| S | 455209 | |
| D | 26219 | 1.4% |
| A | 26219 | 1.4% |
| R | 26219 | 1.4% |
| F | 9715 | 0.5% |
| I | 9715 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 455209 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3099699 | |
| Latin | 1928638 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 455209 | ||
| 1 | 348722 | |
| 2 | 335206 | |
| 3 | 330422 | |
| 4 | 286516 | |
| 6 | 228754 | |
| 0 | 227699 | |
| 7 | 226592 | |
| 5 | 223329 | |
| 9 | 219353 |
Latin
| Value | Count | Frequency (%) |
| N | 464924 | |
| U | 455209 | |
| M | 455209 | |
| S | 455209 | |
| D | 26219 | 1.4% |
| A | 26219 | 1.4% |
| R | 26219 | 1.4% |
| F | 9715 | 0.5% |
| I | 9715 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5028337 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 464924 | 9.2% |
| U | 455209 | 9.1% |
| M | 455209 | 9.1% |
| 455209 | 9.1% | |
| S | 455209 | 9.1% |
| 1 | 348722 | 6.9% |
| 2 | 335206 | 6.7% |
| 3 | 330422 | 6.6% |
| 4 | 286516 | 5.7% |
| 6 | 228754 | 4.5% |
| Other values (10) | 1212957 |
recordNumber
Text
Missing 
| Distinct | 20814 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 434386 |
| Missing (%) | 95.4% |
| Memory size | 3.5 MiB |
Length
| Max length | 42 |
|---|---|
| Median length | 8 |
| Mean length | 8.388456737 |
| Min length | 1 |
Unique
| Unique | 20803 ? |
|---|---|
| Unique (%) | 99.9% |
Sample
| 1st row | PHISH-032 |
|---|---|
| 2nd row | AUST-251 |
| 3rd row | MOC11646 |
| 4th row | RP-202 |
| 5th row | SCIL-052 |
| Value | Count | Frequency (%) |
| blz | 1430 | 5.5% |
| bah | 710 | 2.8% |
| tci | 681 | 2.6% |
| sms | 536 | 2.1% |
| cur | 426 | 1.7% |
| tob | 393 | 1.5% |
| twn | 280 | 1.1% |
| hbb | 157 | 0.6% |
| fcc | 146 | 0.6% |
| keb&mgg | 111 | 0.4% |
| Other values (18988) | 20921 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 16672 | 9.5% |
| 0 | 13195 | 7.6% |
| - | 11059 | 6.3% |
| 2 | 9655 | 5.5% |
| 3 | 7495 | 4.3% |
| 7 | 6732 | 3.9% |
| 4 | 6594 | 3.8% |
| 9 | 6304 | 3.6% |
| S | 6075 | 3.5% |
| I | 5691 | 3.3% |
| Other values (54) | 85226 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 83146 | |
| Uppercase Letter | 68573 | |
| Dash Punctuation | 11059 | 6.3% |
| Lowercase Letter | 6363 | 3.6% |
| Space Separator | 4965 | 2.8% |
| Connector Punctuation | 199 | 0.1% |
| Other Punctuation | 135 | 0.1% |
| Close Punctuation | 129 | 0.1% |
| Open Punctuation | 129 | 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 6075 | 8.9% |
| I | 5691 | 8.3% |
| H | 4851 | 7.1% |
| C | 4741 | 6.9% |
| R | 4572 | 6.7% |
| B | 4223 | 6.2% |
| P | 3828 | 5.6% |
| L | 3820 | 5.6% |
| M | 3805 | 5.5% |
| U | 3666 | 5.3% |
| Other values (16) | 23301 |
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 1435 | |
| m | 1341 | |
| b | 1290 | |
| o | 1289 | |
| n | 256 | 4.0% |
| a | 147 | 2.3% |
| y | 140 | 2.2% |
| u | 138 | 2.2% |
| q | 136 | 2.1% |
| t | 133 | 2.1% |
| Other values (7) | 58 | 0.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 16672 | |
| 0 | 13195 | |
| 2 | 9655 | |
| 3 | 7495 | |
| 7 | 6732 | |
| 4 | 6594 | 7.9% |
| 9 | 6304 | 7.6% |
| 8 | 5653 | 6.8% |
| 5 | 5523 | 6.6% |
| 6 | 5323 | 6.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 111 | |
| ; | 9 | 6.7% |
| . | 9 | 6.7% |
| * | 4 | 3.0% |
| : | 1 | 0.7% |
| ? | 1 | 0.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 11059 |
Space Separator
| Value | Count | Frequency (%) |
| 4965 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 199 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 129 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 129 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 99762 | |
| Latin | 74936 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 6075 | 8.1% |
| I | 5691 | 7.6% |
| H | 4851 | 6.5% |
| C | 4741 | 6.3% |
| R | 4572 | 6.1% |
| B | 4223 | 5.6% |
| P | 3828 | 5.1% |
| L | 3820 | 5.1% |
| M | 3805 | 5.1% |
| U | 3666 | 4.9% |
| Other values (33) | 29664 |
Common
| Value | Count | Frequency (%) |
| 1 | 16672 | |
| 0 | 13195 | |
| - | 11059 | |
| 2 | 9655 | |
| 3 | 7495 | |
| 7 | 6732 | |
| 4 | 6594 | 6.6% |
| 9 | 6304 | 6.3% |
| 8 | 5653 | 5.7% |
| 5 | 5523 | 5.5% |
| Other values (11) | 10880 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 174698 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 16672 | 9.5% |
| 0 | 13195 | 7.6% |
| - | 11059 | 6.3% |
| 2 | 9655 | 5.5% |
| 3 | 7495 | 4.3% |
| 7 | 6732 | 3.9% |
| 4 | 6594 | 3.8% |
| 9 | 6304 | 3.6% |
| S | 6075 | 3.5% |
| I | 5691 | 3.3% |
| Other values (54) | 85226 |
recordedBy
Text
Missing 
| Distinct | 7883 |
|---|---|
| Distinct (%) | 4.7% |
| Missing | 287312 |
| Missing (%) | 63.1% |
| Memory size | 3.5 MiB |
Length
| Max length | 240 |
|---|---|
| Median length | 115 |
| Mean length | 26.11823109 |
| Min length | 1 |
Unique
| Unique | 3022 ? |
|---|---|
| Unique (%) | 1.8% |
Sample
| 1st row | J. Snyder |
|---|---|
| 2nd row | D. Richardson |
| 3rd row | Smithsonian Team, A. Alcala & Silliman University Group |
| 4th row | Bronson |
| 5th row | G. Hendler |
| Value | Count | Frequency (%) |
| 77670 | 9.1% | |
| j | 42738 | 5.0% |
| m | 36874 | 4.3% |
| d | 28293 | 3.3% |
| r | 27606 | 3.2% |
| c | 22145 | 2.6% |
| l | 20146 | 2.3% |
| h | 19636 | 2.3% |
| s | 18374 | 2.1% |
| a | 17770 | 2.1% |
| Other values (4981) | 546427 |
Most occurring characters
| Value | Count | Frequency (%) |
| 689779 | ||
| . | 354996 | 8.1% |
| e | 287100 | 6.5% |
| a | 267898 | 6.1% |
| r | 202145 | 4.6% |
| n | 201834 | 4.6% |
| i | 198853 | 4.5% |
| o | 170908 | 3.9% |
| l | 161879 | 3.7% |
| t | 156528 | 3.6% |
| Other values (66) | 1693331 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2340145 | |
| Uppercase Letter | 780626 | 17.8% |
| Space Separator | 689779 | 15.7% |
| Other Punctuation | 562437 | 12.8% |
| Dash Punctuation | 6626 | 0.2% |
| Open Punctuation | 2738 | 0.1% |
| Close Punctuation | 2738 | 0.1% |
| Decimal Number | 162 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 287100 | |
| a | 267898 | |
| r | 202145 | |
| n | 201834 | |
| i | 198853 | |
| o | 170908 | 7.3% |
| l | 161879 | 6.9% |
| t | 156528 | 6.7% |
| s | 132876 | 5.7% |
| h | 80006 | 3.4% |
| Other values (19) | 480118 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 78820 | 10.1% |
| S | 64727 | 8.3% |
| C | 58544 | 7.5% |
| B | 55964 | 7.2% |
| J | 53696 | 6.9% |
| R | 51837 | 6.6% |
| H | 41812 | 5.4% |
| P | 41068 | 5.3% |
| D | 40847 | 5.2% |
| W | 39861 | 5.1% |
| Other values (16) | 253450 |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 39 | |
| 0 | 26 | |
| 1 | 21 | |
| 8 | 21 | |
| 3 | 17 | |
| 2 | 16 | |
| 7 | 11 | 6.8% |
| 6 | 5 | 3.1% |
| 4 | 5 | 3.1% |
| 5 | 1 | 0.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 354996 | |
| , | 129603 | 23.0% |
| & | 77581 | 13.8% |
| ' | 240 | < 0.1% |
| # | 10 | < 0.1% |
| ? | 5 | < 0.1% |
| / | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 689779 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6626 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2738 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2738 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3120771 | |
| Common | 1264480 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 287100 | 9.2% |
| a | 267898 | 8.6% |
| r | 202145 | 6.5% |
| n | 201834 | 6.5% |
| i | 198853 | 6.4% |
| o | 170908 | 5.5% |
| l | 161879 | 5.2% |
| t | 156528 | 5.0% |
| s | 132876 | 4.3% |
| h | 80006 | 2.6% |
| Other values (45) | 1260744 |
Common
| Value | Count | Frequency (%) |
| 689779 | ||
| . | 354996 | |
| , | 129603 | 10.2% |
| & | 77581 | 6.1% |
| - | 6626 | 0.5% |
| ( | 2738 | 0.2% |
| ) | 2738 | 0.2% |
| ' | 240 | < 0.1% |
| 9 | 39 | < 0.1% |
| 0 | 26 | < 0.1% |
| Other values (11) | 114 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4385214 | |
| None | 37 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 689779 | ||
| . | 354996 | 8.1% |
| e | 287100 | 6.5% |
| a | 267898 | 6.1% |
| r | 202145 | 4.6% |
| n | 201834 | 4.6% |
| i | 198853 | 4.5% |
| o | 170908 | 3.9% |
| l | 161879 | 3.7% |
| t | 156528 | 3.6% |
| Other values (63) | 1693294 |
None
| Value | Count | Frequency (%) |
| ü | 32 | |
| ô | 4 | 10.8% |
| í | 1 | 2.7% |
individualCount
Text
| Distinct | 619 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 15 |
| Missing (%) | < 0.1% |
| Memory size | 3.5 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 1 |
| Mean length | 1.121072854 |
| Min length | 1 |
Unique
| Unique | 251 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 9 |
| 4th row | 12 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 245716 | |
| 2 | 61709 | 13.6% |
| 3 | 30726 | 6.8% |
| 4 | 19436 | 4.3% |
| 5 | 14092 | 3.1% |
| 6 | 10260 | 2.3% |
| 7 | 7454 | 1.6% |
| 10 | 6775 | 1.5% |
| 8 | 6055 | 1.3% |
| 9 | 4855 | 1.1% |
| Other values (609) | 48119 | 10.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 279516 | |
| 2 | 78463 | 15.4% |
| 3 | 39933 | 7.8% |
| 4 | 26278 | 5.1% |
| 5 | 23283 | 4.6% |
| 0 | 18677 | 3.7% |
| 6 | 14874 | 2.9% |
| 7 | 11614 | 2.3% |
| 8 | 9547 | 1.9% |
| 9 | 8124 | 1.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 510309 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 279516 | |
| 2 | 78463 | 15.4% |
| 3 | 39933 | 7.8% |
| 4 | 26278 | 5.1% |
| 5 | 23283 | 4.6% |
| 0 | 18677 | 3.7% |
| 6 | 14874 | 2.9% |
| 7 | 11614 | 2.3% |
| 8 | 9547 | 1.9% |
| 9 | 8124 | 1.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 510309 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 279516 | |
| 2 | 78463 | 15.4% |
| 3 | 39933 | 7.8% |
| 4 | 26278 | 5.1% |
| 5 | 23283 | 4.6% |
| 0 | 18677 | 3.7% |
| 6 | 14874 | 2.9% |
| 7 | 11614 | 2.3% |
| 8 | 9547 | 1.9% |
| 9 | 8124 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 510309 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 279516 | |
| 2 | 78463 | 15.4% |
| 3 | 39933 | 7.8% |
| 4 | 26278 | 5.1% |
| 5 | 23283 | 4.6% |
| 0 | 18677 | 3.7% |
| 6 | 14874 | 2.9% |
| 7 | 11614 | 2.3% |
| 8 | 9547 | 1.9% |
| 9 | 8124 | 1.6% |
sex
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 33.3% |
| Missing | 455209 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MALE |
|---|---|
| 2nd row | MALE |
| 3rd row | MALE |
| Value | Count | Frequency (%) |
| male | 3 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 3 | |
| A | 3 | |
| L | 3 | |
| E | 3 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 12 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 3 | |
| A | 3 | |
| L | 3 | |
| E | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 3 | |
| A | 3 | |
| L | 3 | |
| E | 3 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 3 | |
| A | 3 | |
| L | 3 | |
| E | 3 |
occurrenceStatus
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.5 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 6.999995606 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PRESENT |
|---|---|
| 2nd row | PRESENT |
| 3rd row | PRESENT |
| 4th row | PRESENT |
| 5th row | PRESENT |
| Value | Count | Frequency (%) |
| present | 455210 | |
| absent | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 910422 | |
| S | 455212 | |
| N | 455212 | |
| T | 455212 | |
| P | 455210 | |
| R | 455210 | |
| A | 2 | < 0.1% |
| B | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3186482 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 910422 | |
| S | 455212 | |
| N | 455212 | |
| T | 455212 | |
| P | 455210 | |
| R | 455210 | |
| A | 2 | < 0.1% |
| B | 2 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3186482 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 910422 | |
| S | 455212 | |
| N | 455212 | |
| T | 455212 | |
| P | 455210 | |
| R | 455210 | |
| A | 2 | < 0.1% |
| B | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3186482 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 910422 | |
| S | 455212 | |
| N | 455212 | |
| T | 455212 | |
| P | 455210 | |
| R | 455210 | |
| A | 2 | < 0.1% |
| B | 2 | < 0.1% |
preparations
Text
Missing 
| Distinct | 325 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 346184 |
| Missing (%) | 76.0% |
| Memory size | 3.5 MiB |
Length
| Max length | 255 |
|---|---|
| Median length | 192 |
| Mean length | 11.80351836 |
| Min length | 4 |
Unique
| Unique | 141 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Dry Osteological Specimen |
|---|---|
| 2nd row | Glycerin with Bone Stain |
| 3rd row | Polyester |
| 4th row | Larvae [ETOH Fixed] |
| 5th row | Unknown |
| Value | Count | Frequency (%) |
| larvae | 25640 | |
| polyester | 20066 | 11.4% |
| photograph | 14070 | 8.0% |
| unknown | 11506 | 6.6% |
| film | 9617 | 5.5% |
| specimen | 8056 | 4.6% |
| osteological | 7025 | 4.0% |
| glycerin | 7019 | 4.0% |
| with | 7017 | 4.0% |
| stain | 7012 | 4.0% |
| Other values (60) | 58274 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 130540 | 10.1% |
| a | 117241 | 9.1% |
| o | 94955 | 7.4% |
| r | 91527 | 7.1% |
| t | 83181 | 6.5% |
| n | 69625 | 5.4% |
| l | 66760 | 5.2% |
| i | 66635 | 5.2% |
| 66274 | 5.1% | |
| h | 41878 | 3.3% |
| Other values (46) | 458298 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1023240 | |
| Uppercase Letter | 182950 | 14.2% |
| Space Separator | 66274 | 5.1% |
| Other Punctuation | 6530 | 0.5% |
| Open Punctuation | 3927 | 0.3% |
| Close Punctuation | 3927 | 0.3% |
| Dash Punctuation | 60 | < 0.1% |
| Decimal Number | 6 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 130540 | |
| a | 117241 | |
| o | 94955 | |
| r | 91527 | |
| t | 83181 | 8.1% |
| n | 69625 | 6.8% |
| l | 66760 | 6.5% |
| i | 66635 | 6.5% |
| h | 41878 | 4.1% |
| y | 35275 | 3.4% |
| Other values (13) | 225623 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 34352 | |
| L | 26509 | |
| S | 15784 | |
| F | 15061 | |
| O | 12528 | 6.8% |
| U | 11506 | 6.3% |
| D | 9088 | 5.0% |
| E | 7134 | 3.9% |
| G | 7019 | 3.8% |
| A | 6981 | 3.8% |
| Other values (11) | 36988 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 6172 | |
| . | 186 | 2.8% |
| & | 89 | 1.4% |
| , | 78 | 1.2% |
| % | 5 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 4 | |
| 9 | 1 | 16.7% |
| 5 | 1 | 16.7% |
Space Separator
| Value | Count | Frequency (%) |
| 66274 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 3927 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 3927 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 60 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1206190 | |
| Common | 80724 | 6.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 130540 | 10.8% |
| a | 117241 | 9.7% |
| o | 94955 | 7.9% |
| r | 91527 | 7.6% |
| t | 83181 | 6.9% |
| n | 69625 | 5.8% |
| l | 66760 | 5.5% |
| i | 66635 | 5.5% |
| h | 41878 | 3.5% |
| y | 35275 | 2.9% |
| Other values (34) | 408573 |
Common
| Value | Count | Frequency (%) |
| 66274 | ||
| ; | 6172 | 7.6% |
| [ | 3927 | 4.9% |
| ] | 3927 | 4.9% |
| . | 186 | 0.2% |
| & | 89 | 0.1% |
| , | 78 | 0.1% |
| - | 60 | 0.1% |
| % | 5 | < 0.1% |
| 3 | 4 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1286914 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 130540 | 10.1% |
| a | 117241 | 9.1% |
| o | 94955 | 7.4% |
| r | 91527 | 7.1% |
| t | 83181 | 6.5% |
| n | 69625 | 5.4% |
| l | 66760 | 5.2% |
| i | 66635 | 5.2% |
| 66274 | 5.1% | |
| h | 41878 | 3.3% |
| Other values (46) | 458298 |
Missing 
| Distinct | 447 |
|---|---|
| Distinct (%) | 99.3% |
| Missing | 454762 |
| Missing (%) | 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 249 |
|---|---|
| Median length | 49 |
| Mean length | 59.88888889 |
| Min length | 49 |
Unique
| Unique | 444 ? |
|---|---|
| Unique (%) | 98.7% |
Sample
| 1st row | https://www.ncbi.nlm.nih.gov/gquery?term=FJ609901 |
|---|---|
| 2nd row | https://www.ncbi.nlm.nih.gov/gquery?term=HQ600890 |
| 3rd row | https://www.ncbi.nlm.nih.gov/gquery?term=HM748411 |
| 4th row | https://www.ncbi.nlm.nih.gov/gquery?term=HQ600884 |
| 5th row | https://www.ncbi.nlm.nih.gov/gquery?term=MN621852 |
| Value | Count | Frequency (%) |
| https://www.ncbi.nlm.nih.gov/gquery?term=mn549761 | 2 | 0.4% |
| https://www.ncbi.nlm.nih.gov/gquery?term=hq543050 | 2 | 0.4% |
| https://www.ncbi.nlm.nih.gov/gquery?term=hq543049 | 2 | 0.4% |
| https://www.ncbi.nlm.nih.gov/gquery?term=gq367323 | 1 | 0.2% |
| https://www.ncbi.nlm.nih.gov/gquery?term=hq543043 | 1 | 0.2% |
| https://www.ncbi.nlm.nih.gov/gquery?term=hq600884 | 1 | 0.2% |
| https://www.ncbi.nlm.nih.gov/gquery?term=mn621852 | 1 | 0.2% |
| https://www.ncbi.nlm.nih.gov/gquery?term=hq325698;https://www.ncbi.nlm.nih.gov/gquery?term=hq325631 | 1 | 0.2% |
| https://www.ncbi.nlm.nih.gov/gquery?term=hm748370 | 1 | 0.2% |
| https://www.ncbi.nlm.nih.gov/gquery?term=ef536294;https://www.ncbi.nlm.nih.gov/gquery?term=ef536256;https://www.ncbi.nlm.nih.gov/gquery?term=ef539241;https://www.ncbi.nlm.nih.gov/gquery?term=ef533917;https://www.ncbi.nlm.nih.gov/gquery?term=ef530094 | 1 | 0.2% |
| Other values (437) | 437 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 2192 | 8.1% |
| t | 1644 | 6.1% |
| / | 1644 | 6.1% |
| w | 1644 | 6.1% |
| n | 1644 | 6.1% |
| h | 1096 | 4.1% |
| r | 1096 | 4.1% |
| e | 1096 | 4.1% |
| i | 1096 | 4.1% |
| m | 1096 | 4.1% |
| Other values (41) | 12702 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 16988 | |
| Other Punctuation | 5030 | 18.7% |
| Decimal Number | 3288 | 12.2% |
| Uppercase Letter | 1096 | 4.1% |
| Math Symbol | 548 | 2.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 1644 | 9.7% |
| w | 1644 | 9.7% |
| n | 1644 | 9.7% |
| h | 1096 | 6.5% |
| r | 1096 | 6.5% |
| e | 1096 | 6.5% |
| i | 1096 | 6.5% |
| m | 1096 | 6.5% |
| g | 1096 | 6.5% |
| v | 548 | 3.2% |
| Other values (9) | 4932 |
Uppercase Letter
| Value | Count | Frequency (%) |
| Q | 219 | |
| H | 165 | |
| F | 141 | |
| M | 122 | |
| G | 99 | |
| J | 85 | 7.8% |
| A | 83 | 7.6% |
| N | 75 | 6.8% |
| Y | 30 | 2.7% |
| E | 29 | 2.6% |
| Other values (6) | 48 | 4.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 439 | |
| 0 | 417 | |
| 3 | 406 | |
| 4 | 396 | |
| 7 | 382 | |
| 9 | 352 | |
| 5 | 289 | |
| 8 | 261 | |
| 2 | 201 | |
| 1 | 145 | 4.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2192 | |
| / | 1644 | |
| ? | 548 | 10.9% |
| : | 548 | 10.9% |
| ; | 98 | 1.9% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 548 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 18084 | |
| Common | 8866 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 1644 | 9.1% |
| w | 1644 | 9.1% |
| n | 1644 | 9.1% |
| h | 1096 | 6.1% |
| r | 1096 | 6.1% |
| e | 1096 | 6.1% |
| i | 1096 | 6.1% |
| m | 1096 | 6.1% |
| g | 1096 | 6.1% |
| v | 548 | 3.0% |
| Other values (25) | 6028 |
Common
| Value | Count | Frequency (%) |
| . | 2192 | |
| / | 1644 | |
| = | 548 | 6.2% |
| ? | 548 | 6.2% |
| : | 548 | 6.2% |
| 6 | 439 | 5.0% |
| 0 | 417 | 4.7% |
| 3 | 406 | 4.6% |
| 4 | 396 | 4.5% |
| 7 | 382 | 4.3% |
| Other values (6) | 1346 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 26950 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 2192 | 8.1% |
| t | 1644 | 6.1% |
| / | 1644 | 6.1% |
| w | 1644 | 6.1% |
| n | 1644 | 6.1% |
| h | 1096 | 4.1% |
| r | 1096 | 4.1% |
| e | 1096 | 4.1% |
| i | 1096 | 4.1% |
| m | 1096 | 4.1% |
| Other values (41) | 12702 |
Missing 
| Distinct | 80318 |
|---|---|
| Distinct (%) | 48.8% |
| Missing | 290485 |
| Missing (%) | 63.8% |
| Memory size | 3.5 MiB |
Length
| Max length | 220700 |
|---|---|
| Median length | 28061 |
| Mean length | 75.7703473 |
| Min length | 1 |
Unique
| Unique | 70650 ? |
|---|---|
| Unique (%) | 42.9% |
Sample
| 1st row | Note in ledger: " pair of otoliths"; Otoliths are stored in the Osteo Collection.; Stored in Osteo Collection.; The ototliths are stored in Mugil Box 1 of 1, which contains catalog numbers: 110428, 110429, 110430, 110431, 110432, 110433, 110434, 110435, 110436, 110438, 110439, 110440, and 110441. |
|---|---|
| 2nd row | Cat. no. 105 |
| 3rd row | Host-bohadschia argus. rec from: truett, d. f. |
| 4th row | Specimen measurements as written on the slide mount: SL (mm)= 205; TL (mm)= 10" (254); This material is part of the John and Helen Randall Slide Collection. The slides were digitized October 2017. The Randall donation includes all intellectual property rights.; Black paint/goop on the film. Not obscuring specimen. |
| 5th row | Specimen measurements as written on the slide mount: SL (mm)= 57; TL (mm)= 2.8" (71); This material is part of the John and Helen Randall Slide Collection. The slides were digitized October 2017. The Randall donation includes all intellectual property rights. |
| Value | Count | Frequency (%) |
| the | 73553 | 3.9% |
| of | 50804 | 2.7% |
| in | 34960 | 1.8% |
| and | 29482 | 1.6% |
| mm | 26673 | 1.4% |
| collection | 24234 | 1.3% |
| specimen | 23152 | 1.2% |
| as | 22917 | 1.2% |
| is | 22746 | 1.2% |
| 1 | 22640 | 1.2% |
| Other values (81634) | 1569990 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1609957 | 12.9% | |
| e | 933339 | 7.5% |
| a | 635649 | 5.1% |
| i | 628891 | 5.0% |
| t | 615692 | 4.9% |
| n | 591012 | 4.7% |
| o | 588776 | 4.7% |
| s | 465772 | 3.7% |
| l | 461424 | 3.7% |
| r | 452874 | 3.6% |
| Other values (112) | 5498036 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7643238 | |
| Space Separator | 1609957 | 12.9% |
| Decimal Number | 1182431 | 9.5% |
| Uppercase Letter | 932071 | 7.5% |
| Other Punctuation | 506745 | 4.1% |
| Control | 419081 | 3.4% |
| Dash Punctuation | 70233 | 0.6% |
| Open Punctuation | 32326 | 0.3% |
| Close Punctuation | 32314 | 0.3% |
| Math Symbol | 27221 | 0.2% |
| Other values (5) | 25805 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 933339 | |
| a | 635649 | 8.3% |
| i | 628891 | 8.2% |
| t | 615692 | 8.1% |
| n | 591012 | 7.7% |
| o | 588776 | 7.7% |
| s | 465772 | 6.1% |
| l | 461424 | 6.0% |
| r | 452874 | 5.9% |
| d | 337046 | 4.4% |
| Other values (31) | 1932763 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 114395 | |
| T | 93129 | 10.0% |
| N | 71458 | 7.7% |
| C | 67099 | 7.2% |
| R | 62596 | 6.7% |
| O | 56902 | 6.1% |
| E | 53897 | 5.8% |
| L | 53620 | 5.8% |
| A | 49927 | 5.4% |
| M | 48274 | 5.2% |
| Other values (22) | 260774 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 262249 | |
| , | 80367 | 15.9% |
| : | 61012 | 12.0% |
| ; | 45848 | 9.0% |
| " | 20083 | 4.0% |
| / | 19295 | 3.8% |
| ' | 6995 | 1.4% |
| # | 6654 | 1.3% |
| & | 2738 | 0.5% |
| ? | 808 | 0.2% |
| Other values (6) | 696 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 204458 | |
| 2 | 172093 | |
| 0 | 154696 | |
| 3 | 116867 | |
| 4 | 100430 | |
| 9 | 99868 | |
| 5 | 94414 | |
| 8 | 81370 | 6.9% |
| 7 | 80164 | 6.8% |
| 6 | 78071 | 6.6% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 25537 | |
| + | 1644 | 6.0% |
| ~ | 31 | 0.1% |
| < | 4 | < 0.1% |
| > | 4 | < 0.1% |
| | | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 32094 | |
| [ | 214 | 0.7% |
| { | 18 | 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 32084 | |
| ] | 214 | 0.7% |
| } | 16 | < 0.1% |
Control
| Value | Count | Frequency (%) |
| 417203 | ||
| 1878 | 0.4% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 69691 | |
| – | 542 | 0.8% |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 35 | |
| › | 17 |
Space Separator
| Value | Count | Frequency (%) |
| 1609957 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 25725 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 23 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 3 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ^ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8575309 | |
| Common | 3906113 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 933339 | 10.9% |
| a | 635649 | 7.4% |
| i | 628891 | 7.3% |
| t | 615692 | 7.2% |
| n | 591012 | 6.9% |
| o | 588776 | 6.9% |
| s | 465772 | 5.4% |
| l | 461424 | 5.4% |
| r | 452874 | 5.3% |
| d | 337046 | 3.9% |
| Other values (63) | 2864834 |
Common
| Value | Count | Frequency (%) |
| 1609957 | ||
| 417203 | 10.7% | |
| . | 262249 | 6.7% |
| 1 | 204458 | 5.2% |
| 2 | 172093 | 4.4% |
| 0 | 154696 | 4.0% |
| 3 | 116867 | 3.0% |
| 4 | 100430 | 2.6% |
| 9 | 99868 | 2.6% |
| 5 | 94414 | 2.4% |
| Other values (39) | 673878 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12479964 | |
| None | 841 | < 0.1% |
| Punctuation | 617 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1609957 | 12.9% | |
| e | 933339 | 7.5% |
| a | 635649 | 5.1% |
| i | 628891 | 5.0% |
| t | 615692 | 4.9% |
| n | 591012 | 4.7% |
| o | 588776 | 4.7% |
| s | 465772 | 3.7% |
| l | 461424 | 3.7% |
| r | 452874 | 3.6% |
| Other values (86) | 5496578 |
Punctuation
| Value | Count | Frequency (%) |
| – | 542 | |
| ” | 35 | 5.7% |
| “ | 23 | 3.7% |
| › | 17 | 2.8% |
None
| Value | Count | Frequency (%) |
| ü | 201 | |
| ã | 160 | |
| è | 131 | |
| å | 88 | |
| é | 72 | 8.6% |
| á | 44 | 5.2% |
| ö | 37 | 4.4% |
| ó | 28 | 3.3% |
| í | 21 | 2.5% |
| ê | 21 | 2.5% |
| Other values (12) | 38 | 4.5% |
verbatimLabel
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 455209 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 6.666666667 |
| Min length | 6 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | -9.3883 |
|---|---|
| 2nd row | 10.6925 |
| 3rd row | 7.0083 |
| Value | Count | Frequency (%) |
| 9.3883 | 1 | |
| 10.6925 | 1 | |
| 7.0083 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 3 | |
| 3 | 3 | |
| 8 | 3 | |
| 0 | 3 | |
| 9 | 2 | |
| - | 1 | 5.0% |
| 1 | 1 | 5.0% |
| 6 | 1 | 5.0% |
| 2 | 1 | 5.0% |
| 5 | 1 | 5.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 16 | |
| Other Punctuation | 3 | 15.0% |
| Dash Punctuation | 1 | 5.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 3 | |
| 8 | 3 | |
| 0 | 3 | |
| 9 | 2 | |
| 1 | 1 | 6.2% |
| 6 | 1 | 6.2% |
| 2 | 1 | 6.2% |
| 5 | 1 | 6.2% |
| 7 | 1 | 6.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 20 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 3 | |
| 3 | 3 | |
| 8 | 3 | |
| 0 | 3 | |
| 9 | 2 | |
| - | 1 | 5.0% |
| 1 | 1 | 5.0% |
| 6 | 1 | 5.0% |
| 2 | 1 | 5.0% |
| 5 | 1 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 3 | |
| 3 | 3 | |
| 8 | 3 | |
| 0 | 3 | |
| 9 | 2 | |
| - | 1 | 5.0% |
| 1 | 1 | 5.0% |
| 6 | 1 | 5.0% |
| 2 | 1 | 5.0% |
| 5 | 1 | 5.0% |
materialSampleID
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 455209 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 46.2133 |
|---|---|
| 2nd row | 122.563 |
| 3rd row | 158.199 |
| Value | Count | Frequency (%) |
| 46.2133 | 1 | |
| 122.563 | 1 | |
| 158.199 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 4 | |
| . | 3 | |
| 2 | 3 | |
| 3 | 3 | |
| 6 | 2 | |
| 5 | 2 | |
| 9 | 2 | |
| 4 | 1 | 4.8% |
| 8 | 1 | 4.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 18 | |
| Other Punctuation | 3 | 14.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 4 | |
| 2 | 3 | |
| 3 | 3 | |
| 6 | 2 | |
| 5 | 2 | |
| 9 | 2 | |
| 4 | 1 | 5.6% |
| 8 | 1 | 5.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 21 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 4 | |
| . | 3 | |
| 2 | 3 | |
| 3 | 3 | |
| 6 | 2 | |
| 5 | 2 | |
| 9 | 2 | |
| 4 | 1 | 4.8% |
| 8 | 1 | 4.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 4 | |
| . | 3 | |
| 2 | 3 | |
| 3 | 3 | |
| 6 | 2 | |
| 5 | 2 | |
| 9 | 2 | |
| 4 | 1 | 4.8% |
| 8 | 1 | 4.8% |
eventID
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 455211 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 941.0 |
|---|
| Value | Count | Frequency (%) |
| 941.0 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | 1 | |
| 4 | 1 | |
| 1 | 1 | |
| . | 1 | |
| 0 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4 | |
| Other Punctuation | 1 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 1 | |
| 4 | 1 | |
| 1 | 1 | |
| 0 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 9 | 1 | |
| 4 | 1 | |
| 1 | 1 | |
| . | 1 | |
| 0 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9 | 1 | |
| 4 | 1 | |
| 1 | 1 | |
| . | 1 | |
| 0 | 1 |
fieldNumber
Text
Missing 
| Distinct | 25291 |
|---|---|
| Distinct (%) | 14.0% |
| Missing | 274211 |
| Missing (%) | 60.2% |
| Memory size | 3.5 MiB |
Length
| Max length | 149 |
|---|---|
| Median length | 70 |
| Mean length | 10.07364048 |
| Min length | 1 |
Unique
| Unique | 10523 ? |
|---|---|
| Unique (%) | 5.8% |
Sample
| 1st row | FJS-455 |
|---|---|
| 2nd row | M10-97B4 (40-60m) |
| 3rd row | SP 78-18 |
| 4th row | BBC 1734 A; M-84 |
| 5th row | PHISH-2016-05; SIA-06 |
| Value | Count | Frequency (%) |
| vgs | 19290 | 5.7% |
| jtw | 14298 | 4.2% |
| bbc | 6125 | 1.8% |
| lwk | 4274 | 1.3% |
| lk | 4258 | 1.3% |
| sol | 3414 | 1.0% |
| rpv | 3291 | 1.0% |
| sp | 3134 | 0.9% |
| bayley | 2740 | 0.8% |
| lrp | 2643 | 0.8% |
| Other values (22433) | 275090 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 190028 | 10.4% |
| 157556 | 8.6% | |
| 1 | 125554 | 6.9% |
| 0 | 109950 | 6.0% |
| 2 | 103376 | 5.7% |
| 9 | 89324 | 4.9% |
| 6 | 82127 | 4.5% |
| 7 | 76841 | 4.2% |
| 3 | 73150 | 4.0% |
| 8 | 68610 | 3.8% |
| Other values (72) | 746823 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 855884 | |
| Uppercase Letter | 548142 | |
| Dash Punctuation | 190028 | 10.4% |
| Space Separator | 157556 | 8.6% |
| Other Punctuation | 39951 | 2.2% |
| Lowercase Letter | 29723 | 1.6% |
| Close Punctuation | 995 | 0.1% |
| Open Punctuation | 994 | 0.1% |
| Math Symbol | 62 | < 0.1% |
| Final Punctuation | 3 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 54493 | 9.9% |
| A | 45928 | 8.4% |
| V | 35510 | 6.5% |
| L | 34668 | 6.3% |
| W | 32879 | 6.0% |
| T | 31271 | 5.7% |
| B | 30330 | 5.5% |
| G | 27885 | 5.1% |
| C | 27370 | 5.0% |
| M | 26822 | 4.9% |
| Other values (16) | 200986 |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 5688 | |
| t | 3332 | |
| e | 3308 | |
| a | 3141 | |
| r | 2575 | |
| i | 1867 | 6.3% |
| n | 1554 | 5.2% |
| u | 1507 | 5.1% |
| m | 1427 | 4.8% |
| l | 1415 | 4.8% |
| Other values (15) | 3909 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 32939 | |
| . | 4326 | 10.8% |
| , | 1177 | 2.9% |
| # | 917 | 2.3% |
| : | 267 | 0.7% |
| / | 177 | 0.4% |
| & | 60 | 0.2% |
| ' | 43 | 0.1% |
| ? | 31 | 0.1% |
| " | 8 | < 0.1% |
| Other values (2) | 6 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 125554 | |
| 0 | 109950 | |
| 2 | 103376 | |
| 9 | 89324 | |
| 6 | 82127 | |
| 7 | 76841 | |
| 3 | 73150 | |
| 8 | 68610 | |
| 4 | 65361 | |
| 5 | 61591 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 992 | |
| ] | 3 | 0.3% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 991 | |
| [ | 3 | 0.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 190028 |
Space Separator
| Value | Count | Frequency (%) |
| 157556 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 62 |
Final Punctuation
| Value | Count | Frequency (%) |
| › | 3 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1245474 | |
| Latin | 577865 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 54493 | 9.4% |
| A | 45928 | 7.9% |
| V | 35510 | 6.1% |
| L | 34668 | 6.0% |
| W | 32879 | 5.7% |
| T | 31271 | 5.4% |
| B | 30330 | 5.2% |
| G | 27885 | 4.8% |
| C | 27370 | 4.7% |
| M | 26822 | 4.6% |
| Other values (41) | 230709 |
Common
| Value | Count | Frequency (%) |
| - | 190028 | |
| 157556 | ||
| 1 | 125554 | |
| 0 | 109950 | |
| 2 | 103376 | |
| 9 | 89324 | |
| 6 | 82127 | |
| 7 | 76841 | |
| 3 | 73150 | 5.9% |
| 8 | 68610 | 5.5% |
| Other values (21) | 168958 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1823336 | |
| Punctuation | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 190028 | 10.4% |
| 157556 | 8.6% | |
| 1 | 125554 | 6.9% |
| 0 | 109950 | 6.0% |
| 2 | 103376 | 5.7% |
| 9 | 89324 | 4.9% |
| 6 | 82127 | 4.5% |
| 7 | 76841 | 4.2% |
| 3 | 73150 | 4.0% |
| 8 | 68610 | 3.8% |
| Other values (71) | 746820 |
Punctuation
| Value | Count | Frequency (%) |
| › | 3 |
eventDate
Text
Missing 
| Distinct | 30514 |
|---|---|
| Distinct (%) | 7.7% |
| Missing | 60241 |
| Missing (%) | 13.2% |
| Memory size | 3.5 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 10 |
| Mean length | 10.09004712 |
| Min length | 4 |
Unique
| Unique | 8111 ? |
|---|---|
| Unique (%) | 2.1% |
Sample
| 1st row | 1938-03-25 |
|---|---|
| 2nd row | 1956-05-30 |
| 3rd row | 1997-05-10 |
| 4th row | 1978-05-22 |
| 5th row | 1928-02-10 |
| Value | Count | Frequency (%) |
| 1906 | 1477 | 0.4% |
| 1902 | 1141 | 0.3% |
| 1888 | 1112 | 0.3% |
| 1889 | 938 | 0.2% |
| 1994-05-06 | 927 | 0.2% |
| 1994-04-30 | 702 | 0.2% |
| 1901 | 595 | 0.2% |
| 1880 | 568 | 0.1% |
| 1970-09-11/1970-09-16 | 510 | 0.1% |
| 1893 | 440 | 0.1% |
| Other values (30504) | 386561 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 773124 | |
| 1 | 729306 | |
| 0 | 645975 | |
| 9 | 528070 | |
| 2 | 287210 | 7.2% |
| 8 | 212491 | 5.3% |
| 6 | 183136 | 4.6% |
| 7 | 181671 | 4.6% |
| 5 | 151243 | 3.8% |
| 3 | 141325 | 3.5% |
| Other values (2) | 151725 | 3.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3194948 | |
| Dash Punctuation | 773124 | 19.4% |
| Other Punctuation | 17204 | 0.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 729306 | |
| 0 | 645975 | |
| 9 | 528070 | |
| 2 | 287210 | 9.0% |
| 8 | 212491 | 6.7% |
| 6 | 183136 | 5.7% |
| 7 | 181671 | 5.7% |
| 5 | 151243 | 4.7% |
| 3 | 141325 | 4.4% |
| 4 | 134521 | 4.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 773124 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 17204 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3985276 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 773124 | |
| 1 | 729306 | |
| 0 | 645975 | |
| 9 | 528070 | |
| 2 | 287210 | 7.2% |
| 8 | 212491 | 5.3% |
| 6 | 183136 | 4.6% |
| 7 | 181671 | 4.6% |
| 5 | 151243 | 3.8% |
| 3 | 141325 | 3.5% |
| Other values (2) | 151725 | 3.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3985276 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 773124 | |
| 1 | 729306 | |
| 0 | 645975 | |
| 9 | 528070 | |
| 2 | 287210 | 7.2% |
| 8 | 212491 | 5.3% |
| 6 | 183136 | 4.6% |
| 7 | 181671 | 4.6% |
| 5 | 151243 | 3.8% |
| 3 | 141325 | 3.5% |
| Other values (2) | 151725 | 3.8% |
startDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 91500 |
| Missing (%) | 20.1% |
| Memory size | 3.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.766785259 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 84 |
|---|---|
| 2nd row | 151 |
| 3rd row | 130 |
| 4th row | 142 |
| 5th row | 41 |
| Value | Count | Frequency (%) |
| 126 | 2409 | 0.7% |
| 251 | 1989 | 0.5% |
| 120 | 1928 | 0.5% |
| 117 | 1868 | 0.5% |
| 146 | 1854 | 0.5% |
| 159 | 1853 | 0.5% |
| 143 | 1786 | 0.5% |
| 161 | 1783 | 0.5% |
| 141 | 1777 | 0.5% |
| 154 | 1775 | 0.5% |
| Other values (356) | 344690 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 204382 | |
| 2 | 182397 | |
| 3 | 128618 | |
| 5 | 80887 | 8.0% |
| 4 | 77962 | 7.7% |
| 6 | 76302 | 7.6% |
| 0 | 67269 | 6.7% |
| 7 | 65487 | 6.5% |
| 9 | 63356 | 6.3% |
| 8 | 59653 | 5.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1006313 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 204382 | |
| 2 | 182397 | |
| 3 | 128618 | |
| 5 | 80887 | 8.0% |
| 4 | 77962 | 7.7% |
| 6 | 76302 | 7.6% |
| 0 | 67269 | 6.7% |
| 7 | 65487 | 6.5% |
| 9 | 63356 | 6.3% |
| 8 | 59653 | 5.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1006313 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 204382 | |
| 2 | 182397 | |
| 3 | 128618 | |
| 5 | 80887 | 8.0% |
| 4 | 77962 | 7.7% |
| 6 | 76302 | 7.6% |
| 0 | 67269 | 6.7% |
| 7 | 65487 | 6.5% |
| 9 | 63356 | 6.3% |
| 8 | 59653 | 5.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1006313 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 204382 | |
| 2 | 182397 | |
| 3 | 128618 | |
| 5 | 80887 | 8.0% |
| 4 | 77962 | 7.7% |
| 6 | 76302 | 7.6% |
| 0 | 67269 | 6.7% |
| 7 | 65487 | 6.5% |
| 9 | 63356 | 6.3% |
| 8 | 59653 | 5.9% |
endDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 91500 |
| Missing (%) | 20.1% |
| Memory size | 3.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.768014253 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 84 |
|---|---|
| 2nd row | 151 |
| 3rd row | 130 |
| 4th row | 142 |
| 5th row | 41 |
| Value | Count | Frequency (%) |
| 126 | 2497 | 0.7% |
| 251 | 1977 | 0.5% |
| 120 | 1908 | 0.5% |
| 117 | 1873 | 0.5% |
| 161 | 1803 | 0.5% |
| 116 | 1783 | 0.5% |
| 141 | 1778 | 0.5% |
| 159 | 1755 | 0.5% |
| 146 | 1754 | 0.5% |
| 145 | 1748 | 0.5% |
| Other values (356) | 344836 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 204451 | |
| 2 | 182936 | |
| 3 | 128905 | |
| 5 | 81170 | 8.1% |
| 4 | 77196 | 7.7% |
| 6 | 75877 | 7.5% |
| 0 | 66802 | 6.6% |
| 7 | 65577 | 6.5% |
| 9 | 63938 | 6.4% |
| 8 | 59908 | 6.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1006760 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 204451 | |
| 2 | 182936 | |
| 3 | 128905 | |
| 5 | 81170 | 8.1% |
| 4 | 77196 | 7.7% |
| 6 | 75877 | 7.5% |
| 0 | 66802 | 6.6% |
| 7 | 65577 | 6.5% |
| 9 | 63938 | 6.4% |
| 8 | 59908 | 6.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1006760 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 204451 | |
| 2 | 182936 | |
| 3 | 128905 | |
| 5 | 81170 | 8.1% |
| 4 | 77196 | 7.7% |
| 6 | 75877 | 7.5% |
| 0 | 66802 | 6.6% |
| 7 | 65577 | 6.5% |
| 9 | 63938 | 6.4% |
| 8 | 59908 | 6.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1006760 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 204451 | |
| 2 | 182936 | |
| 3 | 128905 | |
| 5 | 81170 | 8.1% |
| 4 | 77196 | 7.7% |
| 6 | 75877 | 7.5% |
| 0 | 66802 | 6.6% |
| 7 | 65577 | 6.5% |
| 9 | 63938 | 6.4% |
| 8 | 59908 | 6.0% |
year
Text
Missing 
| Distinct | 191 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 60500 |
| Missing (%) | 13.3% |
| Memory size | 3.5 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1938 |
|---|---|
| 2nd row | 1956 |
| 3rd row | 1997 |
| 4th row | 1978 |
| 5th row | 1928 |
| Value | Count | Frequency (%) |
| 1909 | 17685 | 4.5% |
| 1908 | 13638 | 3.5% |
| 1970 | 11215 | 2.8% |
| 1969 | 10429 | 2.6% |
| 1964 | 9054 | 2.3% |
| 1978 | 8877 | 2.2% |
| 1967 | 8372 | 2.1% |
| 1979 | 7846 | 2.0% |
| 1971 | 7828 | 2.0% |
| 1968 | 7271 | 1.8% |
| Other values (181) | 292497 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | 431944 | |
| 1 | 411052 | |
| 0 | 149894 | 9.5% |
| 8 | 131390 | 8.3% |
| 7 | 103219 | 6.5% |
| 6 | 100594 | 6.4% |
| 2 | 83679 | 5.3% |
| 5 | 59572 | 3.8% |
| 4 | 57248 | 3.6% |
| 3 | 50256 | 3.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1578848 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 431944 | |
| 1 | 411052 | |
| 0 | 149894 | 9.5% |
| 8 | 131390 | 8.3% |
| 7 | 103219 | 6.5% |
| 6 | 100594 | 6.4% |
| 2 | 83679 | 5.3% |
| 5 | 59572 | 3.8% |
| 4 | 57248 | 3.6% |
| 3 | 50256 | 3.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1578848 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 9 | 431944 | |
| 1 | 411052 | |
| 0 | 149894 | 9.5% |
| 8 | 131390 | 8.3% |
| 7 | 103219 | 6.5% |
| 6 | 100594 | 6.4% |
| 2 | 83679 | 5.3% |
| 5 | 59572 | 3.8% |
| 4 | 57248 | 3.6% |
| 3 | 50256 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1578848 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9 | 431944 | |
| 1 | 411052 | |
| 0 | 149894 | 9.5% |
| 8 | 131390 | 8.3% |
| 7 | 103219 | 6.5% |
| 6 | 100594 | 6.4% |
| 2 | 83679 | 5.3% |
| 5 | 59572 | 3.8% |
| 4 | 57248 | 3.6% |
| 3 | 50256 | 3.2% |
month
Text
Missing 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 82757 |
| Missing (%) | 18.2% |
| Memory size | 3.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.196200883 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 5 |
| 3rd row | 5 |
| 4th row | 5 |
| 5th row | 2 |
| Value | Count | Frequency (%) |
| 5 | 47294 | |
| 6 | 37778 | |
| 9 | 37734 | |
| 8 | 35582 | |
| 4 | 34250 | |
| 7 | 33265 | |
| 3 | 32600 | |
| 11 | 31410 | |
| 10 | 23798 | |
| 2 | 22626 | |
| Other values (2) | 36118 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 122736 | |
| 5 | 47294 | 10.6% |
| 2 | 40494 | 9.1% |
| 6 | 37778 | 8.5% |
| 9 | 37734 | 8.5% |
| 8 | 35582 | 8.0% |
| 4 | 34250 | 7.7% |
| 7 | 33265 | 7.5% |
| 3 | 32600 | 7.3% |
| 0 | 23798 | 5.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 445531 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 122736 | |
| 5 | 47294 | 10.6% |
| 2 | 40494 | 9.1% |
| 6 | 37778 | 8.5% |
| 9 | 37734 | 8.5% |
| 8 | 35582 | 8.0% |
| 4 | 34250 | 7.7% |
| 7 | 33265 | 7.5% |
| 3 | 32600 | 7.3% |
| 0 | 23798 | 5.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 445531 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 122736 | |
| 5 | 47294 | 10.6% |
| 2 | 40494 | 9.1% |
| 6 | 37778 | 8.5% |
| 9 | 37734 | 8.5% |
| 8 | 35582 | 8.0% |
| 4 | 34250 | 7.7% |
| 7 | 33265 | 7.5% |
| 3 | 32600 | 7.3% |
| 0 | 23798 | 5.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 445531 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 122736 | |
| 5 | 47294 | 10.6% |
| 2 | 40494 | 9.1% |
| 6 | 37778 | 8.5% |
| 9 | 37734 | 8.5% |
| 8 | 35582 | 8.0% |
| 4 | 34250 | 7.7% |
| 7 | 33265 | 7.5% |
| 3 | 32600 | 7.3% |
| 0 | 23798 | 5.3% |
day
Text
Missing 
| Distinct | 32 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 108703 |
| Missing (%) | 23.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 96 |
|---|---|
| Median length | 2 |
| Mean length | 1.693840564 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 25 |
|---|---|
| 2nd row | 30 |
| 3rd row | 10 |
| 4th row | 22 |
| 5th row | 10 |
| Value | Count | Frequency (%) |
| 8 | 13276 | 3.8% |
| 15 | 12432 | 3.6% |
| 5 | 12265 | 3.5% |
| 23 | 12122 | 3.5% |
| 7 | 12104 | 3.5% |
| 6 | 12098 | 3.5% |
| 3 | 11850 | 3.4% |
| 14 | 11823 | 3.4% |
| 16 | 11810 | 3.4% |
| 11 | 11460 | 3.3% |
| Other values (35) | 225282 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 153101 | |
| 2 | 144165 | |
| 3 | 50450 | 8.6% |
| 5 | 35497 | 6.0% |
| 6 | 35148 | 6.0% |
| 8 | 35145 | 6.0% |
| 4 | 34647 | 5.9% |
| 7 | 34166 | 5.8% |
| 9 | 32496 | 5.5% |
| 0 | 32024 | 5.5% |
| Other values (30) | 92 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 586839 | |
| Lowercase Letter | 64 | < 0.1% |
| Space Separator | 13 | < 0.1% |
| Uppercase Letter | 9 | < 0.1% |
| Other Punctuation | 4 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 11 | |
| o | 7 | |
| r | 7 | |
| a | 5 | |
| c | 4 | 6.2% |
| i | 4 | 6.2% |
| t | 4 | 6.2% |
| n | 4 | 6.2% |
| s | 3 | 4.7% |
| d | 3 | 4.7% |
| Other values (9) | 12 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 153101 | |
| 2 | 144165 | |
| 3 | 50450 | 8.6% |
| 5 | 35497 | 6.0% |
| 6 | 35148 | 6.0% |
| 8 | 35145 | 6.0% |
| 4 | 34647 | 5.9% |
| 7 | 34166 | 5.8% |
| 9 | 32496 | 5.5% |
| 0 | 32024 | 5.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 3 | |
| P | 2 | |
| B | 1 | 11.1% |
| C | 1 | 11.1% |
| W | 1 | 11.1% |
| E | 1 | 11.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 | |
| , | 1 | 25.0% |
Space Separator
| Value | Count | Frequency (%) |
| 13 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 586858 | |
| Latin | 73 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 11 | |
| o | 7 | 9.6% |
| r | 7 | 9.6% |
| a | 5 | 6.8% |
| c | 4 | 5.5% |
| i | 4 | 5.5% |
| t | 4 | 5.5% |
| n | 4 | 5.5% |
| s | 3 | 4.1% |
| d | 3 | 4.1% |
| Other values (15) | 21 |
Common
| Value | Count | Frequency (%) |
| 1 | 153101 | |
| 2 | 144165 | |
| 3 | 50450 | 8.6% |
| 5 | 35497 | 6.0% |
| 6 | 35148 | 6.0% |
| 8 | 35145 | 6.0% |
| 4 | 34647 | 5.9% |
| 7 | 34166 | 5.8% |
| 9 | 32496 | 5.5% |
| 0 | 32024 | 5.5% |
| Other values (5) | 19 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 586931 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 153101 | |
| 2 | 144165 | |
| 3 | 50450 | 8.6% |
| 5 | 35497 | 6.0% |
| 6 | 35148 | 6.0% |
| 8 | 35145 | 6.0% |
| 4 | 34647 | 5.9% |
| 7 | 34166 | 5.8% |
| 9 | 32496 | 5.5% |
| 0 | 32024 | 5.5% |
| Other values (30) | 92 | < 0.1% |
Missing 
| Distinct | 34047 |
|---|---|
| Distinct (%) | 9.4% |
| Missing | 92472 |
| Missing (%) | 20.3% |
| Memory size | 3.5 MiB |
Length
| Max length | 102 |
|---|---|
| Median length | 98 |
| Mean length | 26.64553123 |
| Min length | 2 |
Unique
| Unique | 10960 ? |
|---|---|
| Unique (%) | 3.0% |
Sample
| 1st row | 0000 00 00 - 0000 00 00 |
|---|---|
| 2nd row | 1938 Mar 25 - 0000 00 00 |
| 3rd row | 0000 00 00 - 0000 00 00 |
| 4th row | 1956 May 30 - 0000 00 00 |
| 5th row | 0000 00 00 - 0000 00 00 |
| Value | Count | Frequency (%) |
| 00 | 796795 | |
| 420492 | ||
| 0000 | 373796 | |
| may | 36088 | 1.3% |
| jun | 32030 | 1.2% |
| sep | 31699 | 1.2% |
| aug | 30346 | 1.1% |
| apr | 29080 | 1.1% |
| jul | 27007 | 1.0% |
| mar | 26144 | 1.0% |
| Other values (2486) | 893609 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3533769 | |
| 2334346 | ||
| 1 | 644236 | 6.7% |
| 9 | 432298 | 4.5% |
| - | 426014 | 4.4% |
| 2 | 223234 | 2.3% |
| : | 175106 | 1.8% |
| 8 | 153818 | 1.6% |
| 3 | 151315 | 1.6% |
| 5 | 144564 | 1.5% |
| Other values (61) | 1446700 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5678799 | |
| Space Separator | 2334346 | |
| Lowercase Letter | 643687 | 6.7% |
| Dash Punctuation | 426016 | 4.4% |
| Uppercase Letter | 314140 | 3.3% |
| Other Punctuation | 267700 | 2.8% |
| Open Punctuation | 326 | < 0.1% |
| Close Punctuation | 326 | < 0.1% |
| Math Symbol | 60 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 91605 | |
| a | 81376 | |
| e | 70054 | |
| p | 62172 | |
| r | 58263 | |
| n | 50110 | |
| y | 36475 | 5.7% |
| c | 34893 | 5.4% |
| g | 30933 | 4.8% |
| l | 28217 | 4.4% |
| Other values (14) | 99589 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 75515 | |
| M | 63514 | |
| A | 61266 | |
| S | 31961 | |
| N | 25248 | 8.0% |
| F | 19093 | 6.1% |
| O | 17673 | 5.6% |
| D | 16283 | 5.2% |
| H | 1232 | 0.4% |
| T | 729 | 0.2% |
| Other values (13) | 1626 | 0.5% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3533769 | |
| 1 | 644236 | 11.3% |
| 9 | 432298 | 7.6% |
| 2 | 223234 | 3.9% |
| 8 | 153818 | 2.7% |
| 3 | 151315 | 2.7% |
| 5 | 144564 | 2.5% |
| 6 | 135744 | 2.4% |
| 7 | 135695 | 2.4% |
| 4 | 124126 | 2.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 175106 | |
| ; | 89844 | |
| , | 1348 | 0.5% |
| . | 1124 | 0.4% |
| / | 273 | 0.1% |
| ? | 5 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 426014 | |
| – | 2 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 325 | |
| [ | 1 | 0.3% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 325 | |
| ] | 1 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 2334346 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 60 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8707573 | |
| Latin | 957827 | 9.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| u | 91605 | 9.6% |
| a | 81376 | 8.5% |
| J | 75515 | 7.9% |
| e | 70054 | 7.3% |
| M | 63514 | 6.6% |
| p | 62172 | 6.5% |
| A | 61266 | 6.4% |
| r | 58263 | 6.1% |
| n | 50110 | 5.2% |
| y | 36475 | 3.8% |
| Other values (37) | 307477 |
Common
| Value | Count | Frequency (%) |
| 0 | 3533769 | |
| 2334346 | ||
| 1 | 644236 | 7.4% |
| 9 | 432298 | 5.0% |
| - | 426014 | 4.9% |
| 2 | 223234 | 2.6% |
| : | 175106 | 2.0% |
| 8 | 153818 | 1.8% |
| 3 | 151315 | 1.7% |
| 5 | 144564 | 1.7% |
| Other values (14) | 488873 | 5.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9665398 | |
| Punctuation | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3533769 | |
| 2334346 | ||
| 1 | 644236 | 6.7% |
| 9 | 432298 | 4.5% |
| - | 426014 | 4.4% |
| 2 | 223234 | 2.3% |
| : | 175106 | 1.8% |
| 8 | 153818 | 1.6% |
| 3 | 151315 | 1.6% |
| 5 | 144564 | 1.5% |
| Other values (60) | 1446698 |
Punctuation
| Value | Count | Frequency (%) |
| – | 2 |
locationID
Text
Missing 
| Distinct | 16404 |
|---|---|
| Distinct (%) | 15.9% |
| Missing | 352012 |
| Missing (%) | 77.3% |
| Memory size | 3.5 MiB |
Length
| Max length | 68 |
|---|---|
| Median length | 40 |
| Mean length | 5.144757752 |
| Min length | 1 |
Unique
| Unique | 6089 ? |
|---|---|
| Unique (%) | 5.9% |
Sample
| 1st row | M10-97B4 (4 |
|---|---|
| 2nd row | 4-31N |
| 3rd row | 5627 |
| 4th row | 308 |
| 5th row | B12 TR4 |
| Value | Count | Frequency (%) |
| d | 13062 | 9.8% |
| tc | 3543 | 2.7% |
| haul | 1244 | 0.9% |
| trans | 1038 | 0.8% |
| 1 | 918 | 0.7% |
| 2 | 894 | 0.7% |
| tt | 799 | 0.6% |
| 4 | 661 | 0.5% |
| 3 | 655 | 0.5% |
| 5 | 629 | 0.5% |
| Other values (13796) | 109250 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 61871 | 11.7% |
| 2 | 49289 | 9.3% |
| - | 37677 | 7.1% |
| 3 | 36373 | 6.9% |
| 4 | 36203 | 6.8% |
| 5 | 34152 | 6.4% |
| 0 | 31687 | 6.0% |
| 29493 | 5.6% | |
| 7 | 29220 | 5.5% |
| 6 | 27484 | 5.2% |
| Other values (65) | 157490 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 348695 | |
| Uppercase Letter | 95277 | 17.9% |
| Dash Punctuation | 37677 | 7.1% |
| Space Separator | 29493 | 5.6% |
| Other Punctuation | 11083 | 2.1% |
| Lowercase Letter | 7547 | 1.4% |
| Open Punctuation | 805 | 0.2% |
| Close Punctuation | 350 | 0.1% |
| Math Symbol | 12 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 15184 | |
| A | 11061 | |
| T | 9956 | |
| C | 7283 | 7.6% |
| N | 6625 | 7.0% |
| S | 5565 | 5.8% |
| M | 4811 | 5.0% |
| B | 4690 | 4.9% |
| E | 4552 | 4.8% |
| P | 4397 | 4.6% |
| Other values (16) | 21153 |
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 1395 | |
| a | 1021 | |
| r | 854 | |
| i | 678 | |
| m | 582 | |
| q | 576 | |
| o | 499 | 6.6% |
| n | 332 | 4.4% |
| t | 315 | 4.2% |
| e | 236 | 3.1% |
| Other values (13) | 1059 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 61871 | |
| 2 | 49289 | |
| 3 | 36373 | |
| 4 | 36203 | |
| 5 | 34152 | |
| 0 | 31687 | |
| 7 | 29220 | |
| 6 | 27484 | |
| 8 | 21650 | 6.2% |
| 9 | 20766 | 6.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 7463 | |
| ; | 1088 | 9.8% |
| / | 1054 | 9.5% |
| , | 808 | 7.3% |
| & | 269 | 2.4% |
| ? | 185 | 1.7% |
| # | 112 | 1.0% |
| : | 101 | 0.9% |
| ' | 2 | < 0.1% |
| " | 1 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 9 | |
| = | 3 | 25.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 37677 |
Space Separator
| Value | Count | Frequency (%) |
| 29493 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 805 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 350 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 428115 | |
| Latin | 102824 | 19.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| D | 15184 | |
| A | 11061 | 10.8% |
| T | 9956 | 9.7% |
| C | 7283 | 7.1% |
| N | 6625 | 6.4% |
| S | 5565 | 5.4% |
| M | 4811 | 4.7% |
| B | 4690 | 4.6% |
| E | 4552 | 4.4% |
| P | 4397 | 4.3% |
| Other values (39) | 28700 |
Common
| Value | Count | Frequency (%) |
| 1 | 61871 | |
| 2 | 49289 | |
| - | 37677 | |
| 3 | 36373 | |
| 4 | 36203 | |
| 5 | 34152 | |
| 0 | 31687 | |
| 29493 | ||
| 7 | 29220 | |
| 6 | 27484 | |
| Other values (16) | 54666 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 530939 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 61871 | 11.7% |
| 2 | 49289 | 9.3% |
| - | 37677 | 7.1% |
| 3 | 36373 | 6.9% |
| 4 | 36203 | 6.8% |
| 5 | 34152 | 6.4% |
| 0 | 31687 | 6.0% |
| 29493 | 5.6% | |
| 7 | 29220 | 5.5% |
| 6 | 27484 | 5.2% |
| Other values (65) | 157490 |
higherGeography
Text
Missing 
| Distinct | 13756 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 20492 |
| Missing (%) | 4.5% |
| Memory size | 3.5 MiB |
Length
| Max length | 177 |
|---|---|
| Median length | 131 |
| Mean length | 59.33840633 |
| Min length | 4 |
Unique
| Unique | 3844 ? |
|---|---|
| Unique (%) | 0.9% |
Sample
| 1st row | North Pacific Ocean, United States, Hawaii, Hawaiian Islands |
|---|---|
| 2nd row | North Atlantic Ocean, Gulf of Mexico, United States, Florida, Hillsborough County |
| 3rd row | North Pacific Ocean, Japan, Tokyo Prefecture, Japanese Archipelago, Honshu |
| 4th row | North America, United States, West Virginia, Randolph County |
| 5th row | Atlantic, Caribbean Sea, Barbados, Lesser Antilles, Barbados |
| Value | Count | Frequency (%) |
| ocean | 297628 | 8.7% |
| north | 281556 | 8.2% |
| pacific | 178084 | 5.2% |
| united | 125556 | 3.7% |
| states | 125313 | 3.7% |
| islands | 124596 | 3.6% |
| atlantic | 113798 | 3.3% |
| south | 106956 | 3.1% |
| america | 96813 | 2.8% |
| county | 72814 | 2.1% |
| Other values (6594) | 1896684 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2985078 | 11.6% | |
| a | 2677647 | 10.4% |
| i | 1842529 | 7.1% |
| n | 1669001 | 6.5% |
| e | 1623420 | 6.3% |
| t | 1398934 | 5.4% |
| , | 1342963 | 5.2% |
| o | 1188502 | 4.6% |
| c | 1128651 | 4.4% |
| r | 1085797 | 4.2% |
| Other values (113) | 8853070 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 18063036 | |
| Uppercase Letter | 3373888 | 13.1% |
| Space Separator | 2985078 | 11.6% |
| Other Punctuation | 1353273 | 5.2% |
| Open Punctuation | 6855 | < 0.1% |
| Close Punctuation | 6855 | < 0.1% |
| Dash Punctuation | 6542 | < 0.1% |
| Format | 30 | < 0.1% |
| Decimal Number | 28 | < 0.1% |
| Modifier Letter | 7 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2677647 | |
| i | 1842529 | |
| n | 1669001 | |
| e | 1623420 | |
| t | 1398934 | 7.7% |
| o | 1188502 | 6.6% |
| c | 1128651 | 6.2% |
| r | 1085797 | 6.0% |
| l | 929600 | 5.1% |
| s | 877851 | 4.9% |
| Other values (57) | 3641104 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 417855 | |
| P | 393221 | |
| A | 382224 | |
| N | 334378 | |
| O | 326042 | |
| C | 239987 | |
| I | 236820 | 7.0% |
| M | 157026 | 4.7% |
| B | 155806 | 4.6% |
| U | 132231 | 3.9% |
| Other values (26) | 598298 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1342963 | |
| ' | 5377 | 0.4% |
| . | 3635 | 0.3% |
| ; | 1110 | 0.1% |
| / | 108 | < 0.1% |
| : | 80 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 13 | |
| 0 | 7 | |
| 3 | 4 | 14.3% |
| 1 | 3 | 10.7% |
| 4 | 1 | 3.6% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6358 | |
| – | 184 | 2.8% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 5690 | |
| [ | 1165 | 17.0% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 5690 | |
| ] | 1165 | 17.0% |
Space Separator
| Value | Count | Frequency (%) |
| 2985078 |
Format
| Value | Count | Frequency (%) |
| | 30 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 21436924 | |
| Common | 4358668 | 16.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2677647 | 12.5% |
| i | 1842529 | 8.6% |
| n | 1669001 | 7.8% |
| e | 1623420 | 7.6% |
| t | 1398934 | 6.5% |
| o | 1188502 | 5.5% |
| c | 1128651 | 5.3% |
| r | 1085797 | 5.1% |
| l | 929600 | 4.3% |
| s | 877851 | 4.1% |
| Other values (93) | 7014992 |
Common
| Value | Count | Frequency (%) |
| 2985078 | ||
| , | 1342963 | |
| - | 6358 | 0.1% |
| ( | 5690 | 0.1% |
| ) | 5690 | 0.1% |
| ' | 5377 | 0.1% |
| . | 3635 | 0.1% |
| ] | 1165 | < 0.1% |
| [ | 1165 | < 0.1% |
| ; | 1110 | < 0.1% |
| Other values (10) | 437 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 25787588 | |
| None | 7726 | < 0.1% |
| Punctuation | 214 | < 0.1% |
| Latin Ext Additional | 57 | < 0.1% |
| Modifier Letters | 7 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2985078 | 11.6% | |
| a | 2677647 | 10.4% |
| i | 1842529 | 7.1% |
| n | 1669001 | 6.5% |
| e | 1623420 | 6.3% |
| t | 1398934 | 5.4% |
| , | 1342963 | 5.2% |
| o | 1188502 | 4.6% |
| c | 1128651 | 4.4% |
| r | 1085797 | 4.2% |
| Other values (59) | 8845066 |
None
| Value | Count | Frequency (%) |
| ó | 2695 | |
| á | 2628 | |
| í | 1137 | |
| é | 385 | 5.0% |
| ñ | 174 | 2.3% |
| ã | 109 | 1.4% |
| ú | 107 | 1.4% |
| Ō | 78 | 1.0% |
| Î | 59 | 0.8% |
| Ø | 51 | 0.7% |
| Other values (31) | 303 | 3.9% |
Punctuation
| Value | Count | Frequency (%) |
| – | 184 | |
| | 30 | 14.0% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ố | 15 | |
| ộ | 11 | |
| ả | 6 | 10.5% |
| ế | 6 | 10.5% |
| ừ | 5 | 8.8% |
| ổ | 4 | 7.0% |
| ị | 4 | 7.0% |
| ậ | 4 | 7.0% |
| ầ | 1 | 1.8% |
| ợ | 1 | 1.8% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 7 |
continent
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 162647 |
| Missing (%) | 35.7% |
| Memory size | 3.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 8.959157794 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | NORTH_AMERICA |
| 3rd row | ASIA |
| 4th row | AFRICA |
| 5th row | OCEANIA |
| Value | Count | Frequency (%) |
| north_america | 101424 | |
| asia | 73673 | |
| oceania | 62827 | |
| south_america | 34099 | 11.7% |
| africa | 17795 | 6.1% |
| europe | 2346 | 0.8% |
| antarctica | 401 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 580839 | |
| I | 290219 | |
| R | 257489 | |
| C | 216947 | 8.3% |
| E | 203042 | 7.7% |
| O | 200696 | 7.7% |
| N | 164652 | 6.3% |
| T | 136325 | 5.2% |
| H | 135523 | 5.2% |
| _ | 135523 | 5.2% |
| Other values (5) | 299881 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2485613 | |
| Connector Punctuation | 135523 | 5.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 580839 | |
| I | 290219 | |
| R | 257489 | |
| C | 216947 | 8.7% |
| E | 203042 | 8.2% |
| O | 200696 | 8.1% |
| N | 164652 | 6.6% |
| T | 136325 | 5.5% |
| H | 135523 | 5.5% |
| M | 135523 | 5.5% |
| Other values (4) | 164358 | 6.6% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 135523 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2485613 | |
| Common | 135523 | 5.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 580839 | |
| I | 290219 | |
| R | 257489 | |
| C | 216947 | 8.7% |
| E | 203042 | 8.2% |
| O | 200696 | 8.1% |
| N | 164652 | 6.6% |
| T | 136325 | 5.5% |
| H | 135523 | 5.5% |
| M | 135523 | 5.5% |
| Other values (4) | 164358 | 6.6% |
Common
| Value | Count | Frequency (%) |
| _ | 135523 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2621136 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 580839 | |
| I | 290219 | |
| R | 257489 | |
| C | 216947 | 8.3% |
| E | 203042 | 7.7% |
| O | 200696 | 7.7% |
| N | 164652 | 6.3% |
| T | 136325 | 5.2% |
| H | 135523 | 5.2% |
| _ | 135523 | 5.2% |
| Other values (5) | 299881 |
waterBody
Text
Missing 
| Distinct | 1776 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 133275 |
| Missing (%) | 29.3% |
| Memory size | 3.5 MiB |
Length
| Max length | 72 |
|---|---|
| Median length | 71 |
| Mean length | 24.05968559 |
| Min length | 6 |
Unique
| Unique | 489 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | North Pacific Ocean |
|---|---|
| 2nd row | North Atlantic Ocean, Gulf of Mexico |
| 3rd row | North Pacific Ocean |
| 4th row | Atlantic, Caribbean Sea |
| 5th row | North Pacific Ocean |
| Value | Count | Frequency (%) |
| ocean | 296315 | |
| north | 200693 | |
| pacific | 178071 | |
| atlantic | 113701 | 9.4% |
| south | 68065 | 5.6% |
| sea | 63584 | 5.3% |
| of | 34822 | 2.9% |
| gulf | 34750 | 2.9% |
| bay | 30113 | 2.5% |
| indian | 28800 | 2.4% |
| Other values (1364) | 159134 |
Most occurring characters
| Value | Count | Frequency (%) |
| 886111 | ||
| a | 885823 | |
| c | 796020 | 10.3% |
| i | 598543 | 7.7% |
| n | 555364 | 7.2% |
| t | 522462 | 6.7% |
| e | 465166 | 6.0% |
| o | 359377 | 4.6% |
| O | 297236 | 3.8% |
| h | 288188 | 3.7% |
| Other values (58) | 2091413 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5546317 | |
| Uppercase Letter | 1172272 | 15.1% |
| Space Separator | 886111 | 11.4% |
| Other Punctuation | 139267 | 1.8% |
| Dash Punctuation | 1592 | < 0.1% |
| Open Punctuation | 72 | < 0.1% |
| Close Punctuation | 72 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 885823 | |
| c | 796020 | |
| i | 598543 | |
| n | 555364 | |
| t | 522462 | |
| e | 465166 | |
| o | 359377 | |
| h | 288188 | 5.2% |
| r | 272734 | 4.9% |
| f | 249485 | 4.5% |
| Other values (22) | 553155 |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 297236 | |
| N | 202063 | |
| P | 187410 | |
| S | 148088 | |
| A | 121001 | |
| C | 45565 | 3.9% |
| B | 39829 | 3.4% |
| G | 37597 | 3.2% |
| M | 31550 | 2.7% |
| I | 31014 | 2.6% |
| Other values (16) | 30919 | 2.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 137355 | |
| ; | 1110 | 0.8% |
| ' | 517 | 0.4% |
| . | 150 | 0.1% |
| : | 80 | 0.1% |
| / | 55 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 886111 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1592 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 72 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 72 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6718589 | |
| Common | 1027114 | 13.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 885823 | |
| c | 796020 | |
| i | 598543 | 8.9% |
| n | 555364 | 8.3% |
| t | 522462 | 7.8% |
| e | 465166 | 6.9% |
| o | 359377 | 5.3% |
| O | 297236 | 4.4% |
| h | 288188 | 4.3% |
| r | 272734 | 4.1% |
| Other values (48) | 1677676 |
Common
| Value | Count | Frequency (%) |
| 886111 | ||
| , | 137355 | 13.4% |
| - | 1592 | 0.2% |
| ; | 1110 | 0.1% |
| ' | 517 | 0.1% |
| . | 150 | < 0.1% |
| : | 80 | < 0.1% |
| ( | 72 | < 0.1% |
| ) | 72 | < 0.1% |
| / | 55 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7745257 | |
| None | 446 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 886111 | ||
| a | 885823 | |
| c | 796020 | 10.3% |
| i | 598543 | 7.7% |
| n | 555364 | 7.2% |
| t | 522462 | 6.7% |
| e | 465166 | 6.0% |
| o | 359377 | 4.6% |
| O | 297236 | 3.8% |
| h | 288188 | 3.7% |
| Other values (51) | 2090967 |
None
| Value | Count | Frequency (%) |
| í | 171 | |
| á | 95 | |
| ñ | 68 | 15.2% |
| é | 59 | 13.2% |
| ó | 38 | 8.5% |
| è | 13 | 2.9% |
| É | 2 | 0.4% |
islandGroup
Text
Missing 
| Distinct | 323 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 390811 |
| Missing (%) | 85.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 45 |
|---|---|
| Median length | 32 |
| Mean length | 14.81478548 |
| Min length | 4 |
Unique
| Unique | 37 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Florida Islands |
|---|---|
| 2nd row | Vava'u Group |
| 3rd row | Visayas |
| 4th row | Cuyo Islands |
| 5th row | Ha'apai Group |
| Value | Count | Frequency (%) |
| islands | 31617 | |
| group | 13997 | 9.9% |
| chain | 5485 | 3.9% |
| visayas | 4942 | 3.5% |
| leeward | 4824 | 3.4% |
| ralik | 4613 | 3.3% |
| bahama | 2866 | 2.0% |
| island | 2805 | 2.0% |
| cruz | 2205 | 1.6% |
| santa | 2205 | 1.6% |
| Other values (354) | 66278 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 143525 | |
| s | 92363 | 9.7% |
| 77436 | 8.1% | |
| n | 71038 | 7.4% |
| l | 57390 | 6.0% |
| d | 51466 | 5.4% |
| r | 46473 | 4.9% |
| u | 38564 | 4.0% |
| o | 37605 | 3.9% |
| i | 37528 | 3.9% |
| Other values (54) | 300699 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 728655 | |
| Uppercase Letter | 142911 | 15.0% |
| Space Separator | 77436 | 8.1% |
| Open Punctuation | 1946 | 0.2% |
| Close Punctuation | 1946 | 0.2% |
| Other Punctuation | 1144 | 0.1% |
| Format | 30 | < 0.1% |
| Dash Punctuation | 19 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 143525 | |
| s | 92363 | |
| n | 71038 | |
| l | 57390 | 7.9% |
| d | 51466 | 7.1% |
| r | 46473 | 6.4% |
| u | 38564 | 5.3% |
| o | 37605 | 5.2% |
| i | 37528 | 5.2% |
| e | 33860 | 4.6% |
| Other values (20) | 118843 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 34888 | |
| C | 16633 | |
| G | 16301 | |
| B | 11324 | 7.9% |
| S | 9420 | 6.6% |
| L | 8923 | 6.2% |
| R | 8182 | 5.7% |
| V | 7087 | 5.0% |
| T | 5921 | 4.1% |
| A | 5185 | 3.6% |
| Other values (16) | 19047 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1764 | |
| [ | 182 | 9.4% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1764 | |
| ] | 182 | 9.4% |
Space Separator
| Value | Count | Frequency (%) |
| 77436 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 1144 |
Format
| Value | Count | Frequency (%) |
| | 30 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 19 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 871566 | |
| Common | 82521 | 8.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 143525 | |
| s | 92363 | 10.6% |
| n | 71038 | 8.2% |
| l | 57390 | 6.6% |
| d | 51466 | 5.9% |
| r | 46473 | 5.3% |
| u | 38564 | 4.4% |
| o | 37605 | 4.3% |
| i | 37528 | 4.3% |
| I | 34888 | 4.0% |
| Other values (46) | 260726 |
Common
| Value | Count | Frequency (%) |
| 77436 | ||
| ( | 1764 | 2.1% |
| ) | 1764 | 2.1% |
| ' | 1144 | 1.4% |
| [ | 182 | 0.2% |
| ] | 182 | 0.2% |
| | 30 | < 0.1% |
| - | 19 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 953947 | |
| None | 110 | < 0.1% |
| Punctuation | 30 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 143525 | |
| s | 92363 | 9.7% |
| 77436 | 8.1% | |
| n | 71038 | 7.4% |
| l | 57390 | 6.0% |
| d | 51466 | 5.4% |
| r | 46473 | 4.9% |
| u | 38564 | 4.0% |
| o | 37605 | 3.9% |
| i | 37528 | 3.9% |
| Other values (48) | 300559 |
None
| Value | Count | Frequency (%) |
| Ō | 78 | |
| ñ | 18 | 16.4% |
| ù | 5 | 4.5% |
| à | 5 | 4.5% |
| á | 4 | 3.6% |
Punctuation
| Value | Count | Frequency (%) |
| | 30 |
island
Text
Missing 
| Distinct | 2224 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 270596 |
| Missing (%) | 59.4% |
| Memory size | 3.5 MiB |
Length
| Max length | 43 |
|---|---|
| Median length | 37 |
| Mean length | 9.782494475 |
| Min length | 3 |
Unique
| Unique | 463 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | Honshu |
|---|---|
| 2nd row | Barbados |
| 3rd row | Putic Island |
| 4th row | Guam |
| 5th row | Florida Island |
| Value | Count | Frequency (%) |
| island | 45621 | 15.8% |
| bermuda | 14507 | 5.0% |
| atoll | 13109 | 4.5% |
| luzon | 7631 | 2.6% |
| oahu | 6792 | 2.3% |
| cay | 5201 | 1.8% |
| carrie | 3799 | 1.3% |
| bow | 3799 | 1.3% |
| new | 3013 | 1.0% |
| cuba | 2705 | 0.9% |
| Other values (2080) | 182972 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 268860 | |
| n | 133966 | 7.4% |
| o | 116663 | 6.5% |
| l | 110963 | 6.1% |
| 104533 | 5.8% | |
| u | 90794 | 5.0% |
| e | 88610 | 4.9% |
| i | 86645 | 4.8% |
| r | 86277 | 4.8% |
| d | 84321 | 4.7% |
| Other values (70) | 634373 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1403163 | |
| Uppercase Letter | 287699 | 15.9% |
| Space Separator | 104533 | 5.8% |
| Open Punctuation | 3752 | 0.2% |
| Close Punctuation | 3752 | 0.2% |
| Other Punctuation | 1868 | 0.1% |
| Dash Punctuation | 1229 | 0.1% |
| Decimal Number | 8 | < 0.1% |
| Modifier Letter | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 268860 | |
| n | 133966 | |
| o | 116663 | |
| l | 110963 | |
| u | 90794 | 6.5% |
| e | 88610 | 6.3% |
| i | 86645 | 6.2% |
| r | 86277 | 6.1% |
| d | 84321 | 6.0% |
| s | 83762 | 6.0% |
| Other values (30) | 252302 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 50631 | |
| B | 35471 | |
| C | 23025 | 8.0% |
| M | 22483 | 7.8% |
| A | 21385 | 7.4% |
| S | 16304 | 5.7% |
| T | 14185 | 4.9% |
| L | 13182 | 4.6% |
| O | 10784 | 3.7% |
| N | 10606 | 3.7% |
| Other values (17) | 69643 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 1251 | |
| . | 551 | |
| / | 39 | 2.1% |
| , | 27 | 1.4% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2827 | |
| [ | 925 | 24.7% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2827 | |
| ] | 925 | 24.7% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 4 | |
| 0 | 4 |
Space Separator
| Value | Count | Frequency (%) |
| 104533 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1229 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1690862 | |
| Common | 115143 | 6.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 268860 | |
| n | 133966 | 7.9% |
| o | 116663 | 6.9% |
| l | 110963 | 6.6% |
| u | 90794 | 5.4% |
| e | 88610 | 5.2% |
| i | 86645 | 5.1% |
| r | 86277 | 5.1% |
| d | 84321 | 5.0% |
| s | 83762 | 5.0% |
| Other values (57) | 540001 |
Common
| Value | Count | Frequency (%) |
| 104533 | ||
| ( | 2827 | 2.5% |
| ) | 2827 | 2.5% |
| ' | 1251 | 1.1% |
| - | 1229 | 1.1% |
| [ | 925 | 0.8% |
| ] | 925 | 0.8% |
| . | 551 | 0.5% |
| / | 39 | < 0.1% |
| , | 27 | < 0.1% |
| Other values (3) | 9 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1805500 | |
| None | 488 | < 0.1% |
| Latin Ext Additional | 16 | < 0.1% |
| Modifier Letters | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 268860 | |
| n | 133966 | 7.4% |
| o | 116663 | 6.5% |
| l | 110963 | 6.1% |
| 104533 | 5.8% | |
| u | 90794 | 5.0% |
| e | 88610 | 4.9% |
| i | 86645 | 4.8% |
| r | 86277 | 4.8% |
| d | 84321 | 4.7% |
| Other values (53) | 633868 |
None
| Value | Count | Frequency (%) |
| ó | 101 | |
| é | 91 | |
| á | 85 | |
| ñ | 65 | |
| Î | 50 | |
| ú | 41 | |
| í | 17 | 3.5% |
| Á | 12 | 2.5% |
| â | 10 | 2.0% |
| ô | 7 | 1.4% |
| Other values (4) | 9 | 1.8% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ố | 15 | |
| ộ | 1 | 6.2% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 1 |
countryCode
Text
Missing 
| Distinct | 217 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 30434 |
| Missing (%) | 6.7% |
| Memory size | 3.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 10 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | US |
|---|---|
| 2nd row | US |
| 3rd row | JP |
| 4th row | US |
| 5th row | BB |
| Value | Count | Frequency (%) |
| us | 124564 | |
| ph | 46190 | 10.9% |
| bm | 15821 | 3.7% |
| id | 12805 | 3.0% |
| br | 11602 | 2.7% |
| pa | 10456 | 2.5% |
| pf | 9998 | 2.4% |
| pg | 7692 | 1.8% |
| jp | 7188 | 1.7% |
| au | 7086 | 1.7% |
| Other values (207) | 171376 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 147534 | |
| S | 146438 | |
| P | 91776 | |
| H | 58467 | 6.9% |
| B | 50362 | 5.9% |
| M | 50159 | 5.9% |
| C | 28118 | 3.3% |
| A | 26140 | 3.1% |
| I | 23175 | 2.7% |
| T | 22845 | 2.7% |
| Other values (16) | 204542 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 849556 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 147534 | |
| S | 146438 | |
| P | 91776 | |
| H | 58467 | 6.9% |
| B | 50362 | 5.9% |
| M | 50159 | 5.9% |
| C | 28118 | 3.3% |
| A | 26140 | 3.1% |
| I | 23175 | 2.7% |
| T | 22845 | 2.7% |
| Other values (16) | 204542 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 849556 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 147534 | |
| S | 146438 | |
| P | 91776 | |
| H | 58467 | 6.9% |
| B | 50362 | 5.9% |
| M | 50159 | 5.9% |
| C | 28118 | 3.3% |
| A | 26140 | 3.1% |
| I | 23175 | 2.7% |
| T | 22845 | 2.7% |
| Other values (16) | 204542 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 849556 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 147534 | |
| S | 146438 | |
| P | 91776 | |
| H | 58467 | 6.9% |
| B | 50362 | 5.9% |
| M | 50159 | 5.9% |
| C | 28118 | 3.3% |
| A | 26140 | 3.1% |
| I | 23175 | 2.7% |
| T | 22845 | 2.7% |
| Other values (16) | 204542 |
stateProvince
Text
Missing 
| Distinct | 1486 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 174301 |
| Missing (%) | 38.3% |
| Memory size | 3.5 MiB |
Length
| Max length | 48 |
|---|---|
| Median length | 36 |
| Mean length | 11.08342144 |
| Min length | 3 |
Unique
| Unique | 252 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Hawaii |
|---|---|
| 2nd row | Florida |
| 3rd row | Tokyo Prefecture |
| 4th row | West Virginia |
| 5th row | Palawan |
| Value | Count | Frequency (%) |
| province | 30907 | 7.1% |
| florida | 17201 | 4.0% |
| carolina | 12504 | 2.9% |
| virginia | 11495 | 2.7% |
| hawaii | 10718 | 2.5% |
| north | 9674 | 2.2% |
| region | 9360 | 2.2% |
| south | 8306 | 1.9% |
| maryland | 7690 | 1.8% |
| islands | 6712 | 1.6% |
| Other values (1479) | 308426 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 411155 | |
| i | 266017 | 8.5% |
| n | 234755 | 7.5% |
| o | 224195 | 7.2% |
| e | 216314 | 6.9% |
| r | 215084 | 6.9% |
| 152082 | 4.9% | |
| s | 136931 | 4.4% |
| t | 132525 | 4.3% |
| l | 123796 | 4.0% |
| Other values (87) | 1000601 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2526789 | |
| Uppercase Letter | 429904 | 13.8% |
| Space Separator | 152082 | 4.9% |
| Dash Punctuation | 2749 | 0.1% |
| Other Punctuation | 1809 | 0.1% |
| Open Punctuation | 58 | < 0.1% |
| Close Punctuation | 58 | < 0.1% |
| Decimal Number | 6 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 411155 | |
| i | 266017 | |
| n | 234755 | |
| o | 224195 | |
| e | 216314 | |
| r | 215084 | |
| s | 136931 | 5.4% |
| t | 132525 | 5.2% |
| l | 123796 | 4.9% |
| u | 87847 | 3.5% |
| Other values (44) | 478170 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 57745 | |
| M | 38630 | 9.0% |
| S | 38477 | 9.0% |
| C | 36517 | 8.5% |
| N | 28113 | 6.5% |
| T | 23306 | 5.4% |
| A | 22914 | 5.3% |
| D | 18317 | 4.3% |
| F | 18140 | 4.2% |
| R | 16356 | 3.8% |
| Other values (22) | 131389 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 907 | |
| . | 898 | |
| , | 2 | 0.1% |
| / | 2 | 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2719 | |
| – | 30 | 1.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3 | |
| 1 | 3 |
Space Separator
| Value | Count | Frequency (%) |
| 152082 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 58 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 58 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2956693 | |
| Common | 156762 | 5.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 411155 | |
| i | 266017 | 9.0% |
| n | 234755 | 7.9% |
| o | 224195 | 7.6% |
| e | 216314 | 7.3% |
| r | 215084 | 7.3% |
| s | 136931 | 4.6% |
| t | 132525 | 4.5% |
| l | 123796 | 4.2% |
| u | 87847 | 3.0% |
| Other values (76) | 908074 |
Common
| Value | Count | Frequency (%) |
| 152082 | ||
| - | 2719 | 1.7% |
| ' | 907 | 0.6% |
| . | 898 | 0.6% |
| [ | 58 | < 0.1% |
| ] | 58 | < 0.1% |
| – | 30 | < 0.1% |
| 0 | 3 | < 0.1% |
| 1 | 3 | < 0.1% |
| , | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3108939 | |
| None | 4463 | 0.1% |
| Punctuation | 30 | < 0.1% |
| Latin Ext Additional | 23 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 411155 | |
| i | 266017 | 8.6% |
| n | 234755 | 7.6% |
| o | 224195 | 7.2% |
| e | 216314 | 7.0% |
| r | 215084 | 6.9% |
| 152082 | 4.9% | |
| s | 136931 | 4.4% |
| t | 132525 | 4.3% |
| l | 123796 | 4.0% |
| Other values (52) | 996085 |
None
| Value | Count | Frequency (%) |
| á | 1932 | |
| ó | 1584 | |
| í | 532 | 11.9% |
| é | 135 | 3.0% |
| ã | 109 | 2.4% |
| ê | 36 | 0.8% |
| Á | 27 | 0.6% |
| è | 20 | 0.4% |
| å | 11 | 0.2% |
| É | 10 | 0.2% |
| Other values (19) | 67 | 1.5% |
Punctuation
| Value | Count | Frequency (%) |
| – | 30 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ừ | 5 | |
| ế | 5 | |
| ả | 5 | |
| ị | 4 | |
| ậ | 4 |
county
Text
Missing 
| Distinct | 2317 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 357533 |
| Missing (%) | 78.5% |
| Memory size | 3.5 MiB |
Length
| Max length | 46 |
|---|---|
| Median length | 40 |
| Mean length | 14.85270119 |
| Min length | 3 |
Unique
| Unique | 418 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | Hillsborough County |
|---|---|
| 2nd row | Randolph County |
| 3rd row | Thoothukudi District |
| 4th row | Calvert County |
| 5th row | New Hanover County |
| Value | Count | Frequency (%) |
| county | 71646 | |
| district | 9126 | 4.4% |
| honolulu | 5828 | 2.8% |
| monroe | 3066 | 1.5% |
| parish | 2197 | 1.1% |
| carteret | 1943 | 0.9% |
| borough | 1790 | 0.9% |
| san | 1543 | 0.8% |
| montgomery | 1350 | 0.7% |
| barnstable | 1256 | 0.6% |
| Other values (2386) | 105465 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 144007 | 9.9% |
| o | 142669 | 9.8% |
| t | 126035 | 8.7% |
| u | 110453 | 7.6% |
| 107531 | 7.4% | |
| a | 92081 | 6.3% |
| C | 86847 | 6.0% |
| y | 83560 | 5.8% |
| e | 75882 | 5.2% |
| r | 67041 | 4.6% |
| Other values (77) | 414691 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1134683 | |
| Uppercase Letter | 205688 | 14.2% |
| Space Separator | 107531 | 7.4% |
| Other Punctuation | 2008 | 0.1% |
| Dash Punctuation | 823 | 0.1% |
| Open Punctuation | 22 | < 0.1% |
| Close Punctuation | 22 | < 0.1% |
| Decimal Number | 14 | < 0.1% |
| Modifier Letter | 6 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 144007 | |
| o | 142669 | |
| t | 126035 | |
| u | 110453 | |
| a | 92081 | |
| y | 83560 | |
| e | 75882 | |
| r | 67041 | 5.9% |
| i | 59185 | 5.2% |
| l | 50106 | 4.4% |
| Other values (37) | 183664 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 86847 | |
| M | 15547 | 7.6% |
| D | 12611 | 6.1% |
| H | 9959 | 4.8% |
| B | 9681 | 4.7% |
| P | 9349 | 4.5% |
| S | 8796 | 4.3% |
| A | 7470 | 3.6% |
| L | 6041 | 2.9% |
| W | 5570 | 2.7% |
| Other values (19) | 33817 | 16.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 1297 | |
| . | 653 | |
| , | 58 | 2.9% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 669 | |
| – | 154 | 18.7% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 13 | |
| 4 | 1 | 7.1% |
Space Separator
| Value | Count | Frequency (%) |
| 107531 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 22 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 22 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1340371 | |
| Common | 110426 | 7.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 144007 | |
| o | 142669 | |
| t | 126035 | 9.4% |
| u | 110453 | 8.2% |
| a | 92081 | 6.9% |
| C | 86847 | 6.5% |
| y | 83560 | 6.2% |
| e | 75882 | 5.7% |
| r | 67041 | 5.0% |
| i | 59185 | 4.4% |
| Other values (66) | 352611 |
Common
| Value | Count | Frequency (%) |
| 107531 | ||
| ' | 1297 | 1.2% |
| - | 669 | 0.6% |
| . | 653 | 0.6% |
| – | 154 | 0.1% |
| , | 58 | 0.1% |
| ( | 22 | < 0.1% |
| ) | 22 | < 0.1% |
| 2 | 13 | < 0.1% |
| ʻ | 6 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1448457 | |
| None | 2167 | 0.1% |
| Punctuation | 154 | < 0.1% |
| Latin Ext Additional | 13 | < 0.1% |
| Modifier Letters | 6 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 144007 | 9.9% |
| o | 142669 | 9.8% |
| t | 126035 | 8.7% |
| u | 110453 | 7.6% |
| 107531 | 7.4% | |
| a | 92081 | 6.4% |
| C | 86847 | 6.0% |
| y | 83560 | 5.8% |
| e | 75882 | 5.2% |
| r | 67041 | 4.6% |
| Other values (51) | 412351 |
None
| Value | Count | Frequency (%) |
| ó | 969 | |
| á | 512 | |
| í | 396 | |
| é | 78 | 3.6% |
| ú | 66 | 3.0% |
| Ø | 51 | 2.4% |
| ü | 40 | 1.8% |
| ñ | 21 | 1.0% |
| ō | 15 | 0.7% |
| ū | 6 | 0.3% |
| Other values (10) | 13 | 0.6% |
Punctuation
| Value | Count | Frequency (%) |
| – | 154 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ộ | 10 | |
| ợ | 1 | 7.7% |
| ầ | 1 | 7.7% |
| ế | 1 | 7.7% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 6 |
locality
Text
Missing 
| Distinct | 63950 |
|---|---|
| Distinct (%) | 15.6% |
| Missing | 45084 |
| Missing (%) | 9.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 653 |
|---|---|
| Median length | 273 |
| Mean length | 54.14135587 |
| Min length | 1 |
Unique
| Unique | 31252 ? |
|---|---|
| Unique (%) | 7.6% |
Sample
| 1st row | Hawaii |
|---|---|
| 2nd row | Tampa, Florida |
| 3rd row | Tokyo, Japan |
| 4th row | West Virginia, Randolph County, Shaver's Fork at Cheat Bridge on US Route 250 (Durbin Quad) |
| 5th row | No Data |
| Value | Count | Frequency (%) |
| of | 176506 | 5.1% |
| island | 102327 | 3.0% |
| islands | 48966 | 1.4% |
| bay | 45484 | 1.3% |
| river | 43645 | 1.3% |
| reef | 43102 | 1.2% |
| off | 42172 | 1.2% |
| and | 41056 | 1.2% |
| at | 38682 | 1.1% |
| south | 38327 | 1.1% |
| Other values (37098) | 2831019 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3041158 | 13.7% | |
| a | 2147970 | 9.7% |
| e | 1545770 | 7.0% |
| o | 1444005 | 6.5% |
| n | 1302074 | 5.9% |
| i | 1144156 | 5.2% |
| t | 1048248 | 4.7% |
| r | 1047962 | 4.7% |
| s | 988467 | 4.5% |
| l | 837964 | 3.8% |
| Other values (103) | 7657112 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 15649263 | |
| Space Separator | 3041158 | 13.7% |
| Uppercase Letter | 2205461 | 9.9% |
| Other Punctuation | 922659 | 4.2% |
| Decimal Number | 240591 | 1.1% |
| Open Punctuation | 53105 | 0.2% |
| Close Punctuation | 53070 | 0.2% |
| Dash Punctuation | 38007 | 0.2% |
| Math Symbol | 1469 | < 0.1% |
| Other Symbol | 56 | < 0.1% |
| Other values (7) | 47 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2147970 | |
| e | 1545770 | |
| o | 1444005 | |
| n | 1302074 | 8.3% |
| i | 1144156 | 7.3% |
| t | 1048248 | 6.7% |
| r | 1047962 | 6.7% |
| s | 988467 | 6.3% |
| l | 837964 | 5.4% |
| u | 586811 | 3.7% |
| Other values (29) | 3555836 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 210973 | 9.6% |
| C | 210043 | 9.5% |
| I | 192891 | 8.7% |
| B | 180023 | 8.2% |
| P | 172477 | 7.8% |
| M | 151758 | 6.9% |
| R | 133803 | 6.1% |
| N | 108616 | 4.9% |
| A | 99700 | 4.5% |
| T | 89914 | 4.1% |
| Other values (18) | 655263 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 682754 | |
| . | 154051 | 16.7% |
| ; | 33355 | 3.6% |
| : | 23885 | 2.6% |
| ' | 15305 | 1.7% |
| / | 6811 | 0.7% |
| " | 3864 | 0.4% |
| ? | 1335 | 0.1% |
| # | 479 | 0.1% |
| * | 470 | 0.1% |
| Other values (3) | 350 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 46824 | |
| 0 | 42199 | |
| 2 | 35959 | |
| 5 | 25415 | |
| 3 | 22663 | |
| 4 | 19158 | |
| 7 | 13345 | 5.5% |
| 6 | 13208 | 5.5% |
| 8 | 12180 | 5.1% |
| 9 | 9640 | 4.0% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 1198 | |
| ~ | 135 | 9.2% |
| + | 132 | 9.0% |
| > | 4 | 0.3% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 48046 | |
| [ | 5050 | 9.5% |
| { | 9 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 48008 | |
| ] | 5056 | 9.5% |
| } | 6 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 38004 | |
| – | 3 | < 0.1% |
Final Punctuation
| Value | Count | Frequency (%) |
| › | 9 | |
| ” | 2 | 18.2% |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 2 | |
| ^ | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 3041158 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 56 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 25 |
Control
| Value | Count | Frequency (%) |
| | 4 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 2 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 1 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17854724 | |
| Common | 4350162 | 19.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2147970 | 12.0% |
| e | 1545770 | 8.7% |
| o | 1444005 | 8.1% |
| n | 1302074 | 7.3% |
| i | 1144156 | 6.4% |
| t | 1048248 | 5.9% |
| r | 1047962 | 5.9% |
| s | 988467 | 5.5% |
| l | 837964 | 4.7% |
| u | 586811 | 3.3% |
| Other values (57) | 5761297 |
Common
| Value | Count | Frequency (%) |
| 3041158 | ||
| , | 682754 | 15.7% |
| . | 154051 | 3.5% |
| ( | 48046 | 1.1% |
| ) | 48008 | 1.1% |
| 1 | 46824 | 1.1% |
| 0 | 42199 | 1.0% |
| - | 38004 | 0.9% |
| 2 | 35959 | 0.8% |
| ; | 33355 | 0.8% |
| Other values (36) | 179804 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 22204428 | |
| None | 441 | < 0.1% |
| Punctuation | 16 | < 0.1% |
| Modifier Letters | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3041158 | 13.7% | |
| a | 2147970 | 9.7% |
| e | 1545770 | 7.0% |
| o | 1444005 | 6.5% |
| n | 1302074 | 5.9% |
| i | 1144156 | 5.2% |
| t | 1048248 | 4.7% |
| r | 1047962 | 4.7% |
| s | 988467 | 4.5% |
| l | 837964 | 3.8% |
| Other values (79) | 7656654 |
None
| Value | Count | Frequency (%) |
| á | 148 | |
| é | 71 | |
| ã | 67 | |
| ° | 56 | 12.7% |
| ø | 37 | 8.4% |
| à | 14 | 3.2% |
| ó | 14 | 3.2% |
| í | 10 | 2.3% |
| Ù | 5 | 1.1% |
| | 4 | 0.9% |
| Other values (9) | 15 | 3.4% |
Punctuation
| Value | Count | Frequency (%) |
| › | 9 | |
| – | 3 | 18.8% |
| “ | 2 | 12.5% |
| ” | 2 | 12.5% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 1 |
Missing 
| Distinct | 76 |
|---|---|
| Distinct (%) | 3.4% |
| Missing | 453008 |
| Missing (%) | 99.5% |
| Memory size | 3.5 MiB |
Length
| Max length | 152 |
|---|---|
| Median length | 68 |
| Mean length | 46.38838475 |
| Min length | 3 |
Unique
| Unique | 13 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | Rotenone put out at 90' and 120', pickup was surface to 140', several (fiscos=factors?) prevented an even better collection. |
|---|---|
| 2nd row | Distance from shore: 1000 feet |
| 3rd row | 32 not found in field notes so could be inaccurate. |
| 4th row | Distance from shore: 1500 feet |
| 5th row | Naso was speared by P.W. (Paul D. West) |
| Value | Count | Frequency (%) |
| feet | 1680 | 9.0% |
| distance | 1141 | 6.1% |
| from | 1097 | 5.9% |
| to | 1064 | 5.7% |
| shore | 1048 | 5.6% |
| at | 595 | 3.2% |
| 499 | 2.7% | |
| and | 445 | 2.4% |
| rotenone | 430 | 2.3% |
| put | 309 | 1.6% |
| Other values (175) | 10428 |
Most occurring characters
| Value | Count | Frequency (%) |
| 16532 | ||
| e | 10266 | 10.0% |
| t | 7363 | 7.2% |
| o | 6811 | 6.7% |
| a | 5065 | 5.0% |
| s | 4792 | 4.7% |
| f | 4205 | 4.1% |
| n | 4144 | 4.1% |
| r | 4103 | 4.0% |
| 0 | 3353 | 3.3% |
| Other values (60) | 35606 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 67443 | |
| Space Separator | 16532 | 16.2% |
| Decimal Number | 7713 | 7.5% |
| Uppercase Letter | 5089 | 5.0% |
| Other Punctuation | 3746 | 3.7% |
| Dash Punctuation | 742 | 0.7% |
| Open Punctuation | 462 | 0.5% |
| Close Punctuation | 462 | 0.5% |
| Math Symbol | 51 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 10266 | |
| t | 7363 | |
| o | 6811 | |
| a | 5065 | 7.5% |
| s | 4792 | 7.1% |
| f | 4205 | 6.2% |
| n | 4144 | 6.1% |
| r | 4103 | 6.1% |
| i | 3239 | 4.8% |
| c | 2531 | 3.8% |
| Other values (15) | 14924 |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 1545 | |
| T | 692 | |
| P | 592 | 11.6% |
| W | 574 | 11.3% |
| A | 431 | 8.5% |
| R | 430 | 8.4% |
| C | 159 | 3.1% |
| N | 135 | 2.7% |
| G | 90 | 1.8% |
| V | 77 | 1.5% |
| Other values (10) | 364 | 7.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3353 | |
| 1 | 1286 | 16.7% |
| 5 | 905 | 11.7% |
| 2 | 800 | 10.4% |
| 7 | 383 | 5.0% |
| 6 | 218 | 2.8% |
| 8 | 215 | 2.8% |
| 3 | 210 | 2.7% |
| 4 | 199 | 2.6% |
| 9 | 144 | 1.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1561 | |
| : | 1444 | |
| , | 292 | 7.8% |
| ' | 270 | 7.2% |
| " | 97 | 2.6% |
| ? | 51 | 1.4% |
| ; | 30 | 0.8% |
| / | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 318 | |
| [ | 144 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 318 | |
| ] | 144 |
Space Separator
| Value | Count | Frequency (%) |
| 16532 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 742 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 51 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 72532 | |
| Common | 29708 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 10266 | |
| t | 7363 | 10.2% |
| o | 6811 | 9.4% |
| a | 5065 | 7.0% |
| s | 4792 | 6.6% |
| f | 4205 | 5.8% |
| n | 4144 | 5.7% |
| r | 4103 | 5.7% |
| i | 3239 | 4.5% |
| c | 2531 | 3.5% |
| Other values (35) | 20013 |
Common
| Value | Count | Frequency (%) |
| 16532 | ||
| 0 | 3353 | 11.3% |
| . | 1561 | 5.3% |
| : | 1444 | 4.9% |
| 1 | 1286 | 4.3% |
| 5 | 905 | 3.0% |
| 2 | 800 | 2.7% |
| - | 742 | 2.5% |
| 7 | 383 | 1.3% |
| ( | 318 | 1.1% |
| Other values (15) | 2384 | 8.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 102240 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 16532 | ||
| e | 10266 | 10.0% |
| t | 7363 | 7.2% |
| o | 6811 | 6.7% |
| a | 5065 | 5.0% |
| s | 4792 | 4.7% |
| f | 4205 | 4.1% |
| n | 4144 | 4.1% |
| r | 4103 | 4.0% |
| 0 | 3353 | 3.3% |
| Other values (60) | 35606 |
verbatimDepth
Text
Missing 
| Distinct | 230 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 446636 |
| Missing (%) | 98.1% |
| Memory size | 3.5 MiB |
Length
| Max length | 72 |
|---|---|
| Median length | 67 |
| Mean length | 8.249766791 |
| Min length | 1 |
Unique
| Unique | 95 ? |
|---|---|
| Unique (%) | 1.1% |
Sample
| 1st row | Depth trawl: 135 fathoms |
|---|---|
| 2nd row | 15 minutes at depth |
| 3rd row | Surface |
| 4th row | CA |
| 5th row | 15 minutes at depth |
| Value | Count | Frequency (%) |
| ca | 3930 | |
| surface | 2351 | |
| depth | 865 | 5.7% |
| at | 571 | 3.8% |
| 00000000 | 543 | 3.6% |
| to | 505 | 3.3% |
| minutes | 343 | 2.3% |
| fathoms | 330 | 2.2% |
| m | 320 | 2.1% |
| trawl | 287 | 1.9% |
| Other values (305) | 5114 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 7671 | 10.8% |
| 6583 | 9.3% | |
| e | 5141 | 7.3% |
| a | 4475 | 6.3% |
| t | 4124 | 5.8% |
| A | 4103 | 5.8% |
| C | 3949 | 5.6% |
| r | 3468 | 4.9% |
| f | 3202 | 4.5% |
| u | 3133 | 4.4% |
| Other values (66) | 24901 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 39537 | |
| Uppercase Letter | 11262 | 15.9% |
| Decimal Number | 10956 | 15.5% |
| Space Separator | 6583 | 9.3% |
| Other Punctuation | 1980 | 2.8% |
| Dash Punctuation | 370 | 0.5% |
| Math Symbol | 32 | < 0.1% |
| Open Punctuation | 26 | < 0.1% |
| Close Punctuation | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 5141 | |
| a | 4475 | |
| t | 4124 | |
| r | 3468 | |
| f | 3202 | |
| u | 3133 | 7.9% |
| c | 2518 | 6.4% |
| o | 2218 | 5.6% |
| h | 1721 | 4.4% |
| s | 1672 | 4.2% |
| Other values (16) | 7865 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 4103 | |
| C | 3949 | |
| S | 2270 | |
| D | 171 | 1.5% |
| O | 167 | 1.5% |
| T | 137 | 1.2% |
| M | 98 | 0.9% |
| H | 88 | 0.8% |
| I | 57 | 0.5% |
| B | 46 | 0.4% |
| Other values (14) | 176 | 1.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 7671 | |
| 5 | 776 | 7.1% |
| 1 | 744 | 6.8% |
| 3 | 517 | 4.7% |
| 2 | 450 | 4.1% |
| 6 | 241 | 2.2% |
| 9 | 179 | 1.6% |
| 7 | 147 | 1.3% |
| 8 | 119 | 1.1% |
| 4 | 112 | 1.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 699 | |
| , | 459 | |
| ' | 405 | |
| : | 177 | 8.9% |
| ; | 128 | 6.5% |
| " | 106 | 5.4% |
| # | 4 | 0.2% |
| ? | 1 | 0.1% |
| * | 1 | 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| < | 23 | |
| = | 8 | 25.0% |
| ~ | 1 | 3.1% |
Space Separator
| Value | Count | Frequency (%) |
| 6583 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 370 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 26 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 50799 | |
| Common | 19951 | 28.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 5141 | 10.1% |
| a | 4475 | 8.8% |
| t | 4124 | 8.1% |
| A | 4103 | 8.1% |
| C | 3949 | 7.8% |
| r | 3468 | 6.8% |
| f | 3202 | 6.3% |
| u | 3133 | 6.2% |
| c | 2518 | 5.0% |
| S | 2270 | 4.5% |
| Other values (40) | 14416 |
Common
| Value | Count | Frequency (%) |
| 0 | 7671 | |
| 6583 | ||
| 5 | 776 | 3.9% |
| 1 | 744 | 3.7% |
| . | 699 | 3.5% |
| 3 | 517 | 2.6% |
| , | 459 | 2.3% |
| 2 | 450 | 2.3% |
| ' | 405 | 2.0% |
| - | 370 | 1.9% |
| Other values (16) | 1277 | 6.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 70750 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 7671 | 10.8% |
| 6583 | 9.3% | |
| e | 5141 | 7.3% |
| a | 4475 | 6.3% |
| t | 4124 | 5.8% |
| A | 4103 | 5.8% |
| C | 3949 | 5.6% |
| r | 3468 | 4.9% |
| f | 3202 | 4.5% |
| u | 3133 | 4.4% |
| Other values (66) | 24901 |
minimumDistanceAboveSurfaceInMeters
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 455211 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 20 |
| Mean length | 20 |
| Min length | 20 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Williams, Jeffrey T. |
|---|
| Value | Count | Frequency (%) |
| williams | 1 | |
| jeffrey | 1 | |
| t | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 2 | 10.0% |
| l | 2 | 10.0% |
| 2 | 10.0% | |
| e | 2 | 10.0% |
| f | 2 | 10.0% |
| W | 1 | 5.0% |
| a | 1 | 5.0% |
| m | 1 | 5.0% |
| s | 1 | 5.0% |
| , | 1 | 5.0% |
| Other values (5) | 5 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13 | |
| Uppercase Letter | 3 | 15.0% |
| Space Separator | 2 | 10.0% |
| Other Punctuation | 2 | 10.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 2 | |
| l | 2 | |
| e | 2 | |
| f | 2 | |
| a | 1 | |
| m | 1 | |
| s | 1 | |
| r | 1 | |
| y | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 1 | |
| J | 1 | |
| T | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1 | |
| . | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16 | |
| Common | 4 | 20.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 2 | |
| l | 2 | |
| e | 2 | |
| f | 2 | |
| W | 1 | |
| a | 1 | |
| m | 1 | |
| s | 1 | |
| J | 1 | |
| r | 1 | |
| Other values (2) | 2 |
Common
| Value | Count | Frequency (%) |
| 2 | ||
| , | 1 | |
| . | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 2 | 10.0% |
| l | 2 | 10.0% |
| 2 | 10.0% | |
| e | 2 | 10.0% |
| f | 2 | 10.0% |
| W | 1 | 5.0% |
| a | 1 | 5.0% |
| m | 1 | 5.0% |
| s | 1 | 5.0% |
| , | 1 | 5.0% |
| Other values (5) | 5 |
decimalLatitude
Text
Missing 
| Distinct | 15632 |
|---|---|
| Distinct (%) | 7.8% |
| Missing | 254257 |
| Missing (%) | 55.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 6.000413028 |
| Min length | 3 |
Unique
| Unique | 4363 ? |
|---|---|
| Unique (%) | 2.2% |
Sample
| 1st row | 13.2431 |
|---|---|
| 2nd row | 10.9181 |
| 3rd row | 31.93 |
| 4th row | 10.72 |
| 5th row | -2.0517 |
| Value | Count | Frequency (%) |
| 12.5 | 1211 | 0.6% |
| 27.9 | 868 | 0.4% |
| 16.8 | 711 | 0.4% |
| 12.0832 | 620 | 0.3% |
| 21.417 | 545 | 0.3% |
| 19.1606 | 541 | 0.3% |
| 32.23 | 510 | 0.3% |
| 32.17 | 503 | 0.3% |
| 32.3 | 491 | 0.2% |
| 28.4933 | 489 | 0.2% |
| Other values (14220) | 194466 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 200955 | |
| 3 | 140183 | |
| 1 | 139665 | |
| 2 | 127101 | |
| 8 | 92280 | |
| 7 | 91691 | |
| 5 | 84666 | |
| 4 | 71044 | 5.9% |
| 6 | 69792 | 5.8% |
| 9 | 69560 | 5.8% |
| Other values (2) | 118876 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 947134 | |
| Other Punctuation | 200955 | 16.7% |
| Dash Punctuation | 57724 | 4.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 140183 | |
| 1 | 139665 | |
| 2 | 127101 | |
| 8 | 92280 | |
| 7 | 91691 | |
| 5 | 84666 | |
| 4 | 71044 | |
| 6 | 69792 | |
| 9 | 69560 | |
| 0 | 61152 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 200955 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 57724 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1205813 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 200955 | |
| 3 | 140183 | |
| 1 | 139665 | |
| 2 | 127101 | |
| 8 | 92280 | |
| 7 | 91691 | |
| 5 | 84666 | |
| 4 | 71044 | 5.9% |
| 6 | 69792 | 5.8% |
| 9 | 69560 | 5.8% |
| Other values (2) | 118876 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1205813 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 200955 | |
| 3 | 140183 | |
| 1 | 139665 | |
| 2 | 127101 | |
| 8 | 92280 | |
| 7 | 91691 | |
| 5 | 84666 | |
| 4 | 71044 | 5.9% |
| 6 | 69792 | 5.8% |
| 9 | 69560 | 5.8% |
| Other values (2) | 118876 |
decimalLongitude
Text
Missing 
| Distinct | 17148 |
|---|---|
| Distinct (%) | 8.5% |
| Missing | 254257 |
| Missing (%) | 55.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 10 |
| Mean length | 6.717578562 |
| Min length | 3 |
Unique
| Unique | 5089 ? |
|---|---|
| Unique (%) | 2.5% |
Sample
| 1st row | -59.6561 |
|---|---|
| 2nd row | 121.034 |
| 3rd row | -63.95 |
| 4th row | -67.88 |
| 5th row | 130.107 |
| Value | Count | Frequency (%) |
| 177.083 | 872 | 0.4% |
| 93.717 | 815 | 0.4% |
| 88.08 | 737 | 0.4% |
| 68.8991 | 618 | 0.3% |
| 64.0 | 564 | 0.3% |
| 158.417 | 546 | 0.3% |
| 179.756 | 541 | 0.3% |
| 162.875 | 490 | 0.2% |
| 165.83 | 469 | 0.2% |
| 84.9317 | 454 | 0.2% |
| Other values (16304) | 194849 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 200955 | |
| 1 | 164710 | |
| 7 | 129040 | |
| - | 125692 | |
| 8 | 119399 | |
| 3 | 100095 | |
| 6 | 99897 | |
| 2 | 97895 | |
| 5 | 93363 | |
| 4 | 79165 | 5.9% |
| Other values (2) | 139720 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1023284 | |
| Other Punctuation | 200955 | 14.9% |
| Dash Punctuation | 125692 | 9.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 164710 | |
| 7 | 129040 | |
| 8 | 119399 | |
| 3 | 100095 | |
| 6 | 99897 | |
| 2 | 97895 | |
| 5 | 93363 | |
| 4 | 79165 | |
| 9 | 76344 | |
| 0 | 63376 | 6.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 200955 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 125692 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1349931 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 200955 | |
| 1 | 164710 | |
| 7 | 129040 | |
| - | 125692 | |
| 8 | 119399 | |
| 3 | 100095 | |
| 6 | 99897 | |
| 2 | 97895 | |
| 5 | 93363 | |
| 4 | 79165 | 5.9% |
| Other values (2) | 139720 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1349931 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 200955 | |
| 1 | 164710 | |
| 7 | 129040 | |
| - | 125692 | |
| 8 | 119399 | |
| 3 | 100095 | |
| 6 | 99897 | |
| 2 | 97895 | |
| 5 | 93363 | |
| 4 | 79165 | 5.9% |
| Other values (2) | 139720 |
coordinateUncertaintyInMeters
Text
Missing 
| Distinct | 220 |
|---|---|
| Distinct (%) | 4.3% |
| Missing | 450059 |
| Missing (%) | 98.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 5.768872501 |
| Min length | 4 |
Unique
| Unique | 36 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | 100.0 |
|---|---|
| 2nd row | 457.0 |
| 3rd row | 739.0 |
| 4th row | 100.0 |
| 5th row | 8438.0 |
| Value | Count | Frequency (%) |
| 100.0 | 1109 | |
| 10000.0 | 832 | 16.1% |
| 3704.0 | 209 | 4.1% |
| 500.0 | 188 | 3.6% |
| 5000.0 | 122 | 2.4% |
| 278076.0 | 107 | 2.1% |
| 441.0 | 89 | 1.7% |
| 330.0 | 83 | 1.6% |
| 50.0 | 78 | 1.5% |
| 3512.0 | 73 | 1.4% |
| Other values (210) | 2263 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 13130 | |
| . | 5153 | 17.3% |
| 1 | 3226 | 10.9% |
| 2 | 1526 | 5.1% |
| 4 | 1387 | 4.7% |
| 3 | 1195 | 4.0% |
| 5 | 1160 | 3.9% |
| 6 | 830 | 2.8% |
| 7 | 776 | 2.6% |
| 8 | 720 | 2.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 24574 | |
| Other Punctuation | 5153 | 17.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 13130 | |
| 1 | 3226 | 13.1% |
| 2 | 1526 | 6.2% |
| 4 | 1387 | 5.6% |
| 3 | 1195 | 4.9% |
| 5 | 1160 | 4.7% |
| 6 | 830 | 3.4% |
| 7 | 776 | 3.2% |
| 8 | 720 | 2.9% |
| 9 | 624 | 2.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 5153 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 29727 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 13130 | |
| . | 5153 | 17.3% |
| 1 | 3226 | 10.9% |
| 2 | 1526 | 5.1% |
| 4 | 1387 | 4.7% |
| 3 | 1195 | 4.0% |
| 5 | 1160 | 3.9% |
| 6 | 830 | 2.8% |
| 7 | 776 | 2.6% |
| 8 | 720 | 2.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 29727 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 13130 | |
| . | 5153 | 17.3% |
| 1 | 3226 | 10.9% |
| 2 | 1526 | 5.1% |
| 4 | 1387 | 4.7% |
| 3 | 1195 | 4.0% |
| 5 | 1160 | 3.9% |
| 6 | 830 | 2.8% |
| 7 | 776 | 2.6% |
| 8 | 720 | 2.4% |
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 455205 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 2341036 |
|---|---|
| 2nd row | 2384468 |
| 3rd row | 2353475 |
| 4th row | 2373066 |
| 5th row | 2414948 |
| Value | Count | Frequency (%) |
| 2341036 | 1 | |
| 2384468 | 1 | |
| 2353475 | 1 | |
| 2373066 | 1 | |
| 2414948 | 1 | |
| 2393782 | 1 | |
| 2335095 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 11 | |
| 2 | 8 | |
| 4 | 7 | |
| 6 | 4 | 8.2% |
| 8 | 4 | 8.2% |
| 5 | 4 | 8.2% |
| 0 | 3 | 6.1% |
| 7 | 3 | 6.1% |
| 9 | 3 | 6.1% |
| 1 | 2 | 4.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 49 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 11 | |
| 2 | 8 | |
| 4 | 7 | |
| 6 | 4 | 8.2% |
| 8 | 4 | 8.2% |
| 5 | 4 | 8.2% |
| 0 | 3 | 6.1% |
| 7 | 3 | 6.1% |
| 9 | 3 | 6.1% |
| 1 | 2 | 4.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 49 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 11 | |
| 2 | 8 | |
| 4 | 7 | |
| 6 | 4 | 8.2% |
| 8 | 4 | 8.2% |
| 5 | 4 | 8.2% |
| 0 | 3 | 6.1% |
| 7 | 3 | 6.1% |
| 9 | 3 | 6.1% |
| 1 | 2 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 49 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 11 | |
| 2 | 8 | |
| 4 | 7 | |
| 6 | 4 | 8.2% |
| 8 | 4 | 8.2% |
| 5 | 4 | 8.2% |
| 0 | 3 | 6.1% |
| 7 | 3 | 6.1% |
| 9 | 3 | 6.1% |
| 1 | 2 | 4.1% |
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 308939 |
| Missing (%) | 67.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 23 |
| Mean length | 22.92758746 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Degrees Minutes Seconds |
|---|---|
| 2nd row | Degrees Minutes Seconds |
| 3rd row | Degrees Minutes Seconds |
| 4th row | Degrees Minutes Seconds |
| 5th row | Degrees Minutes Seconds |
| Value | Count | Frequency (%) |
| degrees | 146265 | |
| minutes | 144957 | |
| seconds | 144957 | |
| decimal | 1308 | 0.3% |
| unknown | 8 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 730017 | |
| s | 436179 | |
| 291222 | 8.7% | |
| n | 289938 | 8.6% |
| D | 146265 | 4.4% |
| c | 146265 | 4.4% |
| g | 146265 | 4.4% |
| r | 146265 | 4.4% |
| d | 146265 | 4.4% |
| i | 146265 | 4.4% |
| Other values (11) | 728741 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2626278 | |
| Uppercase Letter | 436187 | 13.0% |
| Space Separator | 291222 | 8.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 730017 | |
| s | 436179 | |
| n | 289938 | 11.0% |
| c | 146265 | 5.6% |
| g | 146265 | 5.6% |
| r | 146265 | 5.6% |
| d | 146265 | 5.6% |
| i | 146265 | 5.6% |
| o | 144965 | 5.5% |
| t | 144957 | 5.5% |
| Other values (6) | 148897 | 5.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 146265 | |
| S | 144957 | |
| M | 144957 | |
| U | 8 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 291222 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3062465 | |
| Common | 291222 | 8.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 730017 | |
| s | 436179 | |
| n | 289938 | 9.5% |
| D | 146265 | 4.8% |
| c | 146265 | 4.8% |
| g | 146265 | 4.8% |
| r | 146265 | 4.8% |
| d | 146265 | 4.8% |
| i | 146265 | 4.8% |
| o | 144965 | 4.7% |
| Other values (10) | 583776 |
Common
| Value | Count | Frequency (%) |
| 291222 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3353687 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 730017 | |
| s | 436179 | |
| 291222 | 8.7% | |
| n | 289938 | 8.6% |
| D | 146265 | 4.4% |
| c | 146265 | 4.4% |
| g | 146265 | 4.4% |
| r | 146265 | 4.4% |
| d | 146265 | 4.4% |
| i | 146265 | 4.4% |
| Other values (11) | 728741 |
georeferencedBy
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 455205 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 40 |
|---|---|
| Median length | 36 |
| Mean length | 35.85714286 |
| Min length | 33 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Noturus nocturnus Jordan & Gilbert, 1886 |
|---|---|
| 2nd row | Thalassoma lunare (Linnaeus, 1758) |
| 3rd row | Brycon falcatus Müller & Troschel, 1844 |
| 4th row | Pseudotropheus elongatus Fryer, 1956 |
| 5th row | Halieutaea brevicauda Ogilby, 1910 |
| Value | Count | Frequency (%) |
| 2 | 6.2% | |
| noturus | 1 | 3.1% |
| elongatus | 1 | 3.1% |
| pallas | 1 | 3.1% |
| cirrhosus | 1 | 3.1% |
| blepsias | 1 | 3.1% |
| 1840 | 1 | 3.1% |
| valenciennes | 1 | 3.1% |
| globiceps | 1 | 3.1% |
| scarus | 1 | 3.1% |
| Other values (21) | 21 |
Most occurring characters
| Value | Count | Frequency (%) |
| 25 | 10.0% | |
| a | 19 | 7.6% |
| s | 18 | 7.2% |
| e | 17 | 6.8% |
| r | 15 | 6.0% |
| l | 15 | 6.0% |
| u | 14 | 5.6% |
| n | 11 | 4.4% |
| o | 11 | 4.4% |
| c | 9 | 3.6% |
| Other values (37) | 97 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 169 | |
| Decimal Number | 28 | 11.2% |
| Space Separator | 25 | 10.0% |
| Uppercase Letter | 16 | 6.4% |
| Other Punctuation | 9 | 3.6% |
| Close Punctuation | 2 | 0.8% |
| Open Punctuation | 2 | 0.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 19 | |
| s | 18 | |
| e | 17 | |
| r | 15 | |
| l | 15 | |
| u | 14 | |
| n | 11 | 6.5% |
| o | 11 | 6.5% |
| c | 9 | 5.3% |
| i | 9 | 5.3% |
| Other values (11) | 31 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 2 | |
| P | 2 | |
| T | 2 | |
| O | 1 | 6.2% |
| F | 1 | 6.2% |
| S | 1 | 6.2% |
| H | 1 | 6.2% |
| N | 1 | 6.2% |
| M | 1 | 6.2% |
| L | 1 | 6.2% |
| Other values (3) | 3 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 9 | |
| 8 | 6 | |
| 4 | 4 | |
| 9 | 2 | 7.1% |
| 5 | 2 | 7.1% |
| 6 | 2 | 7.1% |
| 0 | 2 | 7.1% |
| 7 | 1 | 3.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 7 | |
| & | 2 | 22.2% |
Space Separator
| Value | Count | Frequency (%) |
| 25 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 185 | |
| Common | 66 | 26.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 19 | |
| s | 18 | 9.7% |
| e | 17 | 9.2% |
| r | 15 | 8.1% |
| l | 15 | 8.1% |
| u | 14 | 7.6% |
| n | 11 | 5.9% |
| o | 11 | 5.9% |
| c | 9 | 4.9% |
| i | 9 | 4.9% |
| Other values (24) | 47 |
Common
| Value | Count | Frequency (%) |
| 25 | ||
| 1 | 9 | 13.6% |
| , | 7 | 10.6% |
| 8 | 6 | 9.1% |
| 4 | 4 | 6.1% |
| 9 | 2 | 3.0% |
| ) | 2 | 3.0% |
| 5 | 2 | 3.0% |
| ( | 2 | 3.0% |
| 6 | 2 | 3.0% |
| Other values (3) | 5 | 7.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 250 | |
| None | 1 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 25 | 10.0% | |
| a | 19 | 7.6% |
| s | 18 | 7.2% |
| e | 17 | 6.8% |
| r | 15 | 6.0% |
| l | 15 | 6.0% |
| u | 14 | 5.6% |
| n | 11 | 4.4% |
| o | 11 | 4.4% |
| c | 9 | 3.6% |
| Other values (36) | 96 |
None
| Value | Count | Frequency (%) |
| ü | 1 |
Missing 
| Distinct | 16 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 437832 |
| Missing (%) | 96.2% |
| Memory size | 3.5 MiB |
Length
| Max length | 125 |
|---|---|
| Median length | 96 |
| Mean length | 19.25863061 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | GPS |
|---|---|
| 2nd row | On-line Gazetteer |
| 3rd row | Differential GPS |
| 4th row | Guide to Best Practices for Georeferencing. (Chapman and Wieczorek, eds. 2006). Google Earth Pro |
| 5th row | Chart |
| Value | Count | Frequency (%) |
| chart | 6339 | 11.9% |
| gps | 6318 | 11.9% |
| 3627 | 6.8% | |
| earth | 3256 | 6.1% |
| georeferencing | 2448 | 4.6% |
| and | 2426 | 4.6% |
| pro | 2399 | 4.5% |
| 2006 | 2399 | 4.5% |
| wieczorek | 2399 | 4.5% |
| eds | 2399 | 4.5% |
| Other values (37) | 19229 |
Most occurring characters
| Value | Count | Frequency (%) |
| 35859 | 10.7% | |
| e | 34001 | 10.2% |
| r | 26699 | 8.0% |
| a | 21993 | 6.6% |
| t | 20226 | 6.0% |
| o | 19852 | 5.9% |
| G | 16342 | 4.9% |
| n | 13437 | 4.0% |
| h | 12548 | 3.7% |
| i | 12159 | 3.6% |
| Other values (51) | 121599 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 217719 | |
| Uppercase Letter | 53904 | 16.1% |
| Space Separator | 35859 | 10.7% |
| Other Punctuation | 10427 | 3.1% |
| Decimal Number | 10339 | 3.1% |
| Open Punctuation | 2475 | 0.7% |
| Close Punctuation | 2475 | 0.7% |
| Dash Punctuation | 1235 | 0.4% |
| Math Symbol | 282 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 34001 | |
| r | 26699 | |
| a | 21993 | |
| t | 20226 | |
| o | 19852 | |
| n | 13437 | 6.2% |
| h | 12548 | 5.8% |
| i | 12159 | 5.6% |
| c | 10049 | 4.6% |
| s | 7688 | 3.5% |
| Other values (15) | 39067 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 16342 | |
| P | 11268 | |
| C | 8787 | |
| S | 6378 | 11.8% |
| E | 3538 | 6.6% |
| W | 2399 | 4.5% |
| B | 2399 | 4.5% |
| O | 1228 | 2.3% |
| M | 344 | 0.6% |
| R | 331 | 0.6% |
| Other values (9) | 890 | 1.7% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5063 | |
| 2 | 2573 | |
| 6 | 2399 | |
| 1 | 179 | 1.7% |
| 3 | 49 | 0.5% |
| 5 | 49 | 0.5% |
| 4 | 27 | 0.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 7420 | |
| , | 2502 | 24.0% |
| ; | 206 | 2.0% |
| / | 201 | 1.9% |
| : | 98 | 0.9% |
Space Separator
| Value | Count | Frequency (%) |
| 35859 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2475 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2475 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1235 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 282 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 271623 | |
| Common | 63092 | 18.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 34001 | |
| r | 26699 | 9.8% |
| a | 21993 | 8.1% |
| t | 20226 | 7.4% |
| o | 19852 | 7.3% |
| G | 16342 | 6.0% |
| n | 13437 | 4.9% |
| h | 12548 | 4.6% |
| i | 12159 | 4.5% |
| P | 11268 | 4.1% |
| Other values (34) | 83098 |
Common
| Value | Count | Frequency (%) |
| 35859 | ||
| . | 7420 | 11.8% |
| 0 | 5063 | 8.0% |
| 2 | 2573 | 4.1% |
| , | 2502 | 4.0% |
| ( | 2475 | 3.9% |
| ) | 2475 | 3.9% |
| 6 | 2399 | 3.8% |
| - | 1235 | 2.0% |
| + | 282 | 0.4% |
| Other values (7) | 809 | 1.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 334715 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 35859 | 10.7% | |
| e | 34001 | 10.2% |
| r | 26699 | 8.0% |
| a | 21993 | 6.6% |
| t | 20226 | 6.0% |
| o | 19852 | 5.9% |
| G | 16342 | 4.9% |
| n | 13437 | 4.0% |
| h | 12548 | 3.7% |
| i | 12159 | 3.6% |
| Other values (51) | 121599 |
Missing 
| Distinct | 135 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 432197 |
| Missing (%) | 94.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 158 |
|---|---|
| Median length | 2 |
| Mean length | 7.226026504 |
| Min length | 1 |
Unique
| Unique | 64 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | Start; End |
|---|---|
| 2nd row | ca |
| 3rd row | CA |
| 4th row | CA |
| 5th row | CA |
| Value | Count | Frequency (%) |
| ca | 18410 | |
| start | 2530 | 6.4% |
| end | 2436 | 6.1% |
| bank | 1768 | 4.4% |
| flower | 1768 | 4.4% |
| garden | 1768 | 4.4% |
| for | 977 | 2.5% |
| west | 940 | 2.4% |
| east | 828 | 2.1% |
| coordinates | 580 | 1.5% |
| Other values (263) | 7789 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 17102 | 10.3% |
| 16779 | 10.1% | |
| A | 16547 | 9.9% |
| a | 12099 | 7.3% |
| t | 11571 | 7.0% |
| n | 10340 | 6.2% |
| e | 9905 | 6.0% |
| r | 8710 | 5.2% |
| o | 7884 | 4.7% |
| d | 6230 | 3.7% |
| Other values (50) | 49140 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 93445 | |
| Uppercase Letter | 47863 | |
| Space Separator | 16779 | 10.1% |
| Other Punctuation | 6140 | 3.7% |
| Decimal Number | 2011 | 1.2% |
| Dash Punctuation | 65 | < 0.1% |
| Open Punctuation | 2 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 12099 | |
| t | 11571 | |
| n | 10340 | |
| e | 9905 | |
| r | 8710 | |
| o | 7884 | |
| d | 6230 | |
| l | 5770 | |
| i | 4482 | 4.8% |
| c | 4310 | 4.6% |
| Other values (13) | 12144 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 17102 | |
| A | 16547 | |
| E | 3265 | 6.8% |
| S | 2817 | 5.9% |
| G | 2363 | 4.9% |
| B | 1987 | 4.2% |
| F | 1797 | 3.8% |
| W | 1100 | 2.3% |
| O | 216 | 0.5% |
| T | 165 | 0.3% |
| Other values (7) | 504 | 1.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 400 | |
| 3 | 335 | |
| 6 | 216 | |
| 9 | 198 | |
| 8 | 196 | |
| 4 | 177 | |
| 2 | 164 | |
| 5 | 128 | 6.4% |
| 0 | 123 | 6.1% |
| 7 | 74 | 3.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 4208 | |
| . | 1334 | 21.7% |
| , | 592 | 9.6% |
| " | 4 | 0.1% |
| / | 1 | < 0.1% |
| ? | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 16779 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 65 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 141308 | |
| Common | 24999 | 15.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 17102 | |
| A | 16547 | |
| a | 12099 | 8.6% |
| t | 11571 | 8.2% |
| n | 10340 | 7.3% |
| e | 9905 | 7.0% |
| r | 8710 | 6.2% |
| o | 7884 | 5.6% |
| d | 6230 | 4.4% |
| l | 5770 | 4.1% |
| Other values (30) | 35150 |
Common
| Value | Count | Frequency (%) |
| 16779 | ||
| ; | 4208 | 16.8% |
| . | 1334 | 5.3% |
| , | 592 | 2.4% |
| 1 | 400 | 1.6% |
| 3 | 335 | 1.3% |
| 6 | 216 | 0.9% |
| 9 | 198 | 0.8% |
| 8 | 196 | 0.8% |
| 4 | 177 | 0.7% |
| Other values (10) | 564 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 166307 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 17102 | 10.3% |
| 16779 | 10.1% | |
| A | 16547 | 9.9% |
| a | 12099 | 7.3% |
| t | 11571 | 7.0% |
| n | 10340 | 6.2% |
| e | 9905 | 6.0% |
| r | 8710 | 5.2% |
| o | 7884 | 4.7% |
| d | 6230 | 3.7% |
| Other values (50) | 49140 |
latestEonOrHighestEonothem
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 455205 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 141 |
|---|---|
| Median length | 134 |
| Mean length | 126.7142857 |
| Min length | 114 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Animalia, Chordata, Vertebrata, Osteichthyes, Actinopterygii, Neopterygii, Ostariophysi, Siluriformes, Ictaluridae |
|---|---|
| 2nd row | Animalia, Chordata, Vertebrata, Osteichthyes, Actinopterygii, Neopterygii, Acanthopterygii, Perciformes, Labroidei, Labridae |
| 3rd row | Animalia, Chordata, Vertebrata, Osteichthyes, Actinopterygii, Neopterygii, Ostariophysi, Characiformes, Characidae |
| 4th row | Animalia, Chordata, Vertebrata, Osteichthyes, Actinopterygii, Neopterygii, Acanthopterygii, Perciformes, Labroidei, Cichlidae |
| 5th row | Animalia, Chordata, Vertebrata, Osteichthyes, Actinopterygii, Neopterygii, Paracanthopterygii, Lophiiformes, Ogcocephalioidei, Ogcocephalidae |
| Value | Count | Frequency (%) |
| animalia | 7 | |
| vertebrata | 7 | |
| osteichthyes | 7 | |
| actinopterygii | 7 | |
| neopterygii | 7 | |
| chordata | 7 | |
| acanthopterygii | 4 | 5.8% |
| perciformes | 3 | 4.3% |
| labroidei | 3 | 4.3% |
| ostariophysi | 2 | 2.9% |
| Other values (15) | 15 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 101 | 11.4% |
| e | 82 | 9.2% |
| a | 73 | 8.2% |
| t | 73 | 8.2% |
| r | 66 | 7.4% |
| , | 62 | 7.0% |
| 62 | 7.0% | |
| o | 45 | 5.1% |
| h | 34 | 3.8% |
| c | 33 | 3.7% |
| Other values (21) | 256 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 694 | |
| Uppercase Letter | 69 | 7.8% |
| Other Punctuation | 62 | 7.0% |
| Space Separator | 62 | 7.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 101 | |
| e | 82 | |
| a | 73 | |
| t | 73 | |
| r | 66 | |
| o | 45 | 6.5% |
| h | 34 | 4.9% |
| c | 33 | 4.8% |
| y | 28 | 4.0% |
| p | 26 | 3.7% |
| Other values (9) | 133 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 18 | |
| O | 11 | |
| C | 11 | |
| V | 7 | 10.1% |
| N | 7 | 10.1% |
| L | 5 | 7.2% |
| S | 4 | 5.8% |
| P | 4 | 5.8% |
| I | 1 | 1.4% |
| H | 1 | 1.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 62 |
Space Separator
| Value | Count | Frequency (%) |
| 62 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 763 | |
| Common | 124 | 14.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 101 | |
| e | 82 | |
| a | 73 | 9.6% |
| t | 73 | 9.6% |
| r | 66 | 8.7% |
| o | 45 | 5.9% |
| h | 34 | 4.5% |
| c | 33 | 4.3% |
| y | 28 | 3.7% |
| p | 26 | 3.4% |
| Other values (19) | 202 |
Common
| Value | Count | Frequency (%) |
| , | 62 | |
| 62 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 887 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 101 | 11.4% |
| e | 82 | 9.2% |
| a | 73 | 8.2% |
| t | 73 | 8.2% |
| r | 66 | 7.4% |
| , | 62 | 7.0% |
| 62 | 7.0% | |
| o | 45 | 5.1% |
| h | 34 | 3.8% |
| c | 33 | 3.7% |
| Other values (21) | 256 |
earliestEraOrLowestErathem
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 14.3% |
| Missing | 455205 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Animalia |
|---|---|
| 2nd row | Animalia |
| 3rd row | Animalia |
| 4th row | Animalia |
| 5th row | Animalia |
| Value | Count | Frequency (%) |
| animalia | 7 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 14 | |
| a | 14 | |
| A | 7 | |
| n | 7 | |
| m | 7 | |
| l | 7 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 49 | |
| Uppercase Letter | 7 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 14 | |
| a | 14 | |
| n | 7 | |
| m | 7 | |
| l | 7 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 56 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 14 | |
| a | 14 | |
| A | 7 | |
| n | 7 | |
| m | 7 | |
| l | 7 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 56 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 14 | |
| a | 14 | |
| A | 7 | |
| n | 7 | |
| m | 7 | |
| l | 7 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 14.3% |
| Missing | 455205 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Chordata |
|---|---|
| 2nd row | Chordata |
| 3rd row | Chordata |
| 4th row | Chordata |
| 5th row | Chordata |
| Value | Count | Frequency (%) |
| chordata | 7 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 14 | |
| C | 7 | |
| h | 7 | |
| o | 7 | |
| r | 7 | |
| d | 7 | |
| t | 7 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 49 | |
| Uppercase Letter | 7 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 14 | |
| h | 7 | |
| o | 7 | |
| r | 7 | |
| d | 7 | |
| t | 7 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 56 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 14 | |
| C | 7 | |
| h | 7 | |
| o | 7 | |
| r | 7 | |
| d | 7 | |
| t | 7 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 56 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 14 | |
| C | 7 | |
| h | 7 | |
| o | 7 | |
| r | 7 | |
| d | 7 | |
| t | 7 |
latestPeriodOrHighestSystem
Text
Missing 
| Distinct | 5 |
|---|---|
| Distinct (%) | 71.4% |
| Missing | 455205 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 13 |
| Mean length | 12.14285714 |
| Min length | 11 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 57.1% |
Sample
| 1st row | Siluriformes |
|---|---|
| 2nd row | Perciformes |
| 3rd row | Characiformes |
| 4th row | Perciformes |
| 5th row | Lophiiformes |
| Value | Count | Frequency (%) |
| perciformes | 3 | |
| siluriformes | 1 | 14.3% |
| characiformes | 1 | 14.3% |
| lophiiformes | 1 | 14.3% |
| scorpaeniformes | 1 | 14.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 13 | |
| e | 11 | |
| i | 9 | |
| o | 9 | |
| f | 7 | |
| m | 7 | |
| s | 7 | |
| c | 5 | 5.9% |
| P | 3 | 3.5% |
| a | 3 | 3.5% |
| Other values (8) | 11 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 78 | |
| Uppercase Letter | 7 | 8.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 13 | |
| e | 11 | |
| i | 9 | |
| o | 9 | |
| f | 7 | |
| m | 7 | |
| s | 7 | |
| c | 5 | 6.4% |
| a | 3 | 3.8% |
| h | 2 | 2.6% |
| Other values (4) | 5 | 6.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 3 | |
| S | 2 | |
| C | 1 | 14.3% |
| L | 1 | 14.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 85 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 13 | |
| e | 11 | |
| i | 9 | |
| o | 9 | |
| f | 7 | |
| m | 7 | |
| s | 7 | |
| c | 5 | 5.9% |
| P | 3 | 3.5% |
| a | 3 | 3.5% |
| Other values (8) | 11 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 85 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 13 | |
| e | 11 | |
| i | 9 | |
| o | 9 | |
| f | 7 | |
| m | 7 | |
| s | 7 | |
| c | 5 | 5.9% |
| P | 3 | 3.5% |
| a | 3 | 3.5% |
| Other values (8) | 11 |
latestEpochOrHighestSeries
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 455205 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 11 |
| Mean length | 10.71428571 |
| Min length | 8 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Ictaluridae |
|---|---|
| 2nd row | Labridae |
| 3rd row | Bryconidae |
| 4th row | Cichlidae |
| 5th row | Ogcocephalidae |
| Value | Count | Frequency (%) |
| ictaluridae | 1 | |
| labridae | 1 | |
| bryconidae | 1 | |
| cichlidae | 1 | |
| ogcocephalidae | 1 | |
| scaridae | 1 | |
| hemitripteridae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 11 | |
| i | 10 | |
| e | 10 | |
| d | 7 | |
| r | 6 | 8.0% |
| c | 6 | 8.0% |
| t | 3 | 4.0% |
| l | 3 | 4.0% |
| h | 2 | 2.7% |
| p | 2 | 2.7% |
| Other values (14) | 15 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 68 | |
| Uppercase Letter | 7 | 9.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 11 | |
| i | 10 | |
| e | 10 | |
| d | 7 | |
| r | 6 | |
| c | 6 | |
| t | 3 | 4.4% |
| l | 3 | 4.4% |
| h | 2 | 2.9% |
| p | 2 | 2.9% |
| Other values (7) | 8 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 1 | |
| H | 1 | |
| S | 1 | |
| O | 1 | |
| B | 1 | |
| C | 1 | |
| L | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 75 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 11 | |
| i | 10 | |
| e | 10 | |
| d | 7 | |
| r | 6 | 8.0% |
| c | 6 | 8.0% |
| t | 3 | 4.0% |
| l | 3 | 4.0% |
| h | 2 | 2.7% |
| p | 2 | 2.7% |
| Other values (14) | 15 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 75 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 11 | |
| i | 10 | |
| e | 10 | |
| d | 7 | |
| r | 6 | 8.0% |
| c | 6 | 8.0% |
| t | 3 | 4.0% |
| l | 3 | 4.0% |
| h | 2 | 2.7% |
| p | 2 | 2.7% |
| Other values (14) | 15 |
highestBiostratigraphicZone
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 455205 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 10 |
| Mean length | 8.714285714 |
| Min length | 6 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Noturus |
|---|---|
| 2nd row | Thalassoma |
| 3rd row | Brycon |
| 4th row | Pseudotropheus |
| 5th row | Halieutaea |
| Value | Count | Frequency (%) |
| noturus | 1 | |
| thalassoma | 1 | |
| brycon | 1 | |
| pseudotropheus | 1 | |
| halieutaea | 1 | |
| scarus | 1 | |
| blepsias | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 8 | |
| a | 8 | |
| u | 6 | 9.8% |
| e | 5 | 8.2% |
| o | 5 | 8.2% |
| r | 4 | 6.6% |
| t | 3 | 4.9% |
| l | 3 | 4.9% |
| B | 2 | 3.3% |
| i | 2 | 3.3% |
| Other values (12) | 15 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 54 | |
| Uppercase Letter | 7 | 11.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 8 | |
| a | 8 | |
| u | 6 | |
| e | 5 | |
| o | 5 | |
| r | 4 | |
| t | 3 | 5.6% |
| l | 3 | 5.6% |
| i | 2 | 3.7% |
| h | 2 | 3.7% |
| Other values (6) | 8 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 2 | |
| H | 1 | |
| N | 1 | |
| P | 1 | |
| T | 1 | |
| S | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 61 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 8 | |
| a | 8 | |
| u | 6 | 9.8% |
| e | 5 | 8.2% |
| o | 5 | 8.2% |
| r | 4 | 6.6% |
| t | 3 | 4.9% |
| l | 3 | 4.9% |
| B | 2 | 3.3% |
| i | 2 | 3.3% |
| Other values (12) | 15 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 61 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 8 | |
| a | 8 | |
| u | 6 | 9.8% |
| e | 5 | 8.2% |
| o | 5 | 8.2% |
| r | 4 | 6.6% |
| t | 3 | 4.9% |
| l | 3 | 4.9% |
| B | 2 | 3.3% |
| i | 2 | 3.3% |
| Other values (12) | 15 |
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 455205 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 10 |
| Mean length | 8.714285714 |
| Min length | 6 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Noturus |
|---|---|
| 2nd row | Thalassoma |
| 3rd row | Brycon |
| 4th row | Pseudotropheus |
| 5th row | Halieutaea |
| Value | Count | Frequency (%) |
| noturus | 1 | |
| thalassoma | 1 | |
| brycon | 1 | |
| pseudotropheus | 1 | |
| halieutaea | 1 | |
| scarus | 1 | |
| blepsias | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 8 | |
| a | 8 | |
| u | 6 | 9.8% |
| e | 5 | 8.2% |
| o | 5 | 8.2% |
| r | 4 | 6.6% |
| t | 3 | 4.9% |
| l | 3 | 4.9% |
| B | 2 | 3.3% |
| i | 2 | 3.3% |
| Other values (12) | 15 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 54 | |
| Uppercase Letter | 7 | 11.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 8 | |
| a | 8 | |
| u | 6 | |
| e | 5 | |
| o | 5 | |
| r | 4 | |
| t | 3 | 5.6% |
| l | 3 | 5.6% |
| i | 2 | 3.7% |
| h | 2 | 3.7% |
| Other values (6) | 8 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 2 | |
| H | 1 | |
| N | 1 | |
| P | 1 | |
| T | 1 | |
| S | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 61 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 8 | |
| a | 8 | |
| u | 6 | 9.8% |
| e | 5 | 8.2% |
| o | 5 | 8.2% |
| r | 4 | 6.6% |
| t | 3 | 4.9% |
| l | 3 | 4.9% |
| B | 2 | 3.3% |
| i | 2 | 3.3% |
| Other values (12) | 15 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 61 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 8 | |
| a | 8 | |
| u | 6 | 9.8% |
| e | 5 | 8.2% |
| o | 5 | 8.2% |
| r | 4 | 6.6% |
| t | 3 | 4.9% |
| l | 3 | 4.9% |
| B | 2 | 3.3% |
| i | 2 | 3.3% |
| Other values (12) | 15 |
member
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 455205 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 8.571428571 |
| Min length | 6 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | nocturnus |
|---|---|
| 2nd row | lunare |
| 3rd row | falcatus |
| 4th row | elongatus |
| 5th row | brevicauda |
| Value | Count | Frequency (%) |
| nocturnus | 1 | |
| lunare | 1 | |
| falcatus | 1 | |
| elongatus | 1 | |
| brevicauda | 1 | |
| globiceps | 1 | |
| cirrhosus | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| u | 7 | |
| a | 6 | |
| s | 6 | |
| c | 5 | |
| r | 5 | |
| n | 4 | 6.7% |
| o | 4 | 6.7% |
| e | 4 | 6.7% |
| l | 4 | 6.7% |
| t | 3 | 5.0% |
| Other values (8) | 12 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 60 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 7 | |
| a | 6 | |
| s | 6 | |
| c | 5 | |
| r | 5 | |
| n | 4 | 6.7% |
| o | 4 | 6.7% |
| e | 4 | 6.7% |
| l | 4 | 6.7% |
| t | 3 | 5.0% |
| Other values (8) | 12 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 60 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| u | 7 | |
| a | 6 | |
| s | 6 | |
| c | 5 | |
| r | 5 | |
| n | 4 | 6.7% |
| o | 4 | 6.7% |
| e | 4 | 6.7% |
| l | 4 | 6.7% |
| t | 3 | 5.0% |
| Other values (8) | 12 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 60 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| u | 7 | |
| a | 6 | |
| s | 6 | |
| c | 5 | |
| r | 5 | |
| n | 4 | 6.7% |
| o | 4 | 6.7% |
| e | 4 | 6.7% |
| l | 4 | 6.7% |
| t | 3 | 5.0% |
| Other values (8) | 12 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 14.3% |
| Missing | 455205 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SPECIES |
|---|---|
| 2nd row | SPECIES |
| 3rd row | SPECIES |
| 4th row | SPECIES |
| 5th row | SPECIES |
| Value | Count | Frequency (%) |
| species | 7 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 14 | |
| E | 14 | |
| P | 7 | |
| C | 7 | |
| I | 7 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 49 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 14 | |
| E | 14 | |
| P | 7 | |
| C | 7 | |
| I | 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 49 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 14 | |
| E | 14 | |
| P | 7 | |
| C | 7 | |
| I | 7 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 49 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 14 | |
| E | 14 | |
| P | 7 | |
| C | 7 | |
| I | 7 |
Missing 
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 453516 |
| Missing (%) | 99.6% |
| Memory size | 3.5 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 3 |
| Mean length | 5.780660377 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | cf. |
|---|---|
| 2nd row | uncertain |
| 3rd row | uncertain |
| 4th row | uncertain |
| 5th row | near |
| Value | Count | Frequency (%) |
| cf | 895 | |
| uncertain | 783 | |
| aff | 14 | 0.8% |
| near | 4 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 1678 | |
| n | 1570 | |
| f | 923 | |
| . | 909 | |
| a | 801 | |
| e | 787 | |
| r | 787 | |
| t | 783 | |
| i | 783 | |
| u | 652 | 6.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8764 | |
| Other Punctuation | 909 | 9.3% |
| Uppercase Letter | 131 | 1.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 1678 | |
| n | 1570 | |
| f | 923 | |
| a | 801 | |
| e | 787 | |
| r | 787 | |
| t | 783 | |
| i | 783 | |
| u | 652 | 7.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 909 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 131 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8895 | |
| Common | 909 | 9.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 1678 | |
| n | 1570 | |
| f | 923 | |
| a | 801 | |
| e | 787 | |
| r | 787 | |
| t | 783 | |
| i | 783 | |
| u | 652 | 7.3% |
| U | 131 | 1.5% |
Common
| Value | Count | Frequency (%) |
| . | 909 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9804 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 1678 | |
| n | 1570 | |
| f | 923 | |
| . | 909 | |
| a | 801 | |
| e | 787 | |
| r | 787 | |
| t | 783 | |
| i | 783 | |
| u | 652 | 6.7% |
typeStatus
Text
Missing 
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 436448 |
| Missing (%) | 95.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 8 |
| Mean length | 7.670219569 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PARATYPE |
|---|---|
| 2nd row | HOLOTYPE |
| 3rd row | PARATYPE |
| 4th row | PARATYPE |
| 5th row | COTYPE |
| Value | Count | Frequency (%) |
| paratype | 12437 | |
| holotype | 3339 | 17.8% |
| type | 1470 | 7.8% |
| syntype | 819 | 4.4% |
| cotype | 296 | 1.6% |
| paralectotype | 207 | 1.1% |
| lectotype | 127 | 0.7% |
| neotype | 59 | 0.3% |
| allotype | 10 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 31408 | |
| A | 25298 | |
| Y | 19583 | |
| E | 19157 | |
| T | 19098 | |
| R | 12644 | |
| O | 7377 | 5.1% |
| L | 3693 | 2.6% |
| H | 3339 | 2.3% |
| N | 878 | 0.6% |
| Other values (2) | 1449 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 143924 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 31408 | |
| A | 25298 | |
| Y | 19583 | |
| E | 19157 | |
| T | 19098 | |
| R | 12644 | |
| O | 7377 | 5.1% |
| L | 3693 | 2.6% |
| H | 3339 | 2.3% |
| N | 878 | 0.6% |
| Other values (2) | 1449 | 1.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 143924 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| P | 31408 | |
| A | 25298 | |
| Y | 19583 | |
| E | 19157 | |
| T | 19098 | |
| R | 12644 | |
| O | 7377 | 5.1% |
| L | 3693 | 2.6% |
| H | 3339 | 2.3% |
| N | 878 | 0.6% |
| Other values (2) | 1449 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 143924 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| P | 31408 | |
| A | 25298 | |
| Y | 19583 | |
| E | 19157 | |
| T | 19098 | |
| R | 12644 | |
| O | 7377 | 5.1% |
| L | 3693 | 2.6% |
| H | 3339 | 2.3% |
| N | 878 | 0.6% |
| Other values (2) | 1449 | 1.0% |
identifiedBy
Text
Missing 
| Distinct | 572 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 421073 |
| Missing (%) | 92.5% |
| Memory size | 3.5 MiB |
Length
| Max length | 147 |
|---|---|
| Median length | 137 |
| Mean length | 21.13904918 |
| Min length | 5 |
Unique
| Unique | 143 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | Pezold, Frank; Larson, Helen K. |
|---|---|
| 2nd row | Williams, Jeffrey T. |
| 3rd row | Williams, Jeffrey T. |
| 4th row | Eschmeyer, William N. |
| 5th row | Karnella, Susan J. |
| Value | Count | Frequency (%) |
| williams | 6495 | 5.8% |
| jeffrey | 6367 | 5.7% |
| t | 6366 | 5.7% |
| e | 4376 | 3.9% |
| david | 4213 | 3.8% |
| g | 4044 | 3.6% |
| smith | 3785 | 3.4% |
| c | 2656 | 2.4% |
| pitassy | 2526 | 2.3% |
| diane | 2526 | 2.3% |
| Other values (967) | 68435 |
Most occurring characters
| Value | Count | Frequency (%) |
| 77650 | 10.8% | |
| a | 55232 | 7.7% |
| i | 54043 | 7.5% |
| e | 50214 | 7.0% |
| , | 37806 | 5.2% |
| r | 34027 | 4.7% |
| l | 32102 | 4.4% |
| n | 30823 | 4.3% |
| . | 26916 | 3.7% |
| t | 26854 | 3.7% |
| Other values (59) | 295999 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 443204 | |
| Uppercase Letter | 127586 | 17.7% |
| Space Separator | 77650 | 10.8% |
| Other Punctuation | 66171 | 9.2% |
| Dash Punctuation | 2357 | 0.3% |
| Close Punctuation | 2272 | 0.3% |
| Open Punctuation | 2272 | 0.3% |
| Final Punctuation | 77 | < 0.1% |
| Initial Punctuation | 77 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 55232 | |
| i | 54043 | |
| e | 50214 | |
| r | 34027 | 7.7% |
| l | 32102 | 7.2% |
| n | 30823 | 7.0% |
| t | 26854 | 6.1% |
| s | 22669 | 5.1% |
| o | 22269 | 5.0% |
| m | 19827 | 4.5% |
| Other values (21) | 95144 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 11485 | 9.0% |
| D | 10253 | 8.0% |
| J | 9551 | 7.5% |
| S | 9476 | 7.4% |
| W | 9319 | 7.3% |
| C | 8939 | 7.0% |
| E | 7478 | 5.9% |
| A | 6920 | 5.4% |
| H | 6036 | 4.7% |
| G | 5572 | 4.4% |
| Other values (16) | 42557 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 37806 | |
| . | 26916 | |
| ; | 1042 | 1.6% |
| / | 387 | 0.6% |
| ' | 18 | < 0.1% |
| & | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 77650 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2357 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2272 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2272 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 77 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 77 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 570790 | |
| Common | 150876 | 20.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 55232 | 9.7% |
| i | 54043 | 9.5% |
| e | 50214 | 8.8% |
| r | 34027 | 6.0% |
| l | 32102 | 5.6% |
| n | 30823 | 5.4% |
| t | 26854 | 4.7% |
| s | 22669 | 4.0% |
| o | 22269 | 3.9% |
| m | 19827 | 3.5% |
| Other values (47) | 222730 |
Common
| Value | Count | Frequency (%) |
| 77650 | ||
| , | 37806 | |
| . | 26916 | 17.8% |
| - | 2357 | 1.6% |
| ) | 2272 | 1.5% |
| ( | 2272 | 1.5% |
| ; | 1042 | 0.7% |
| / | 387 | 0.3% |
| ” | 77 | 0.1% |
| “ | 77 | 0.1% |
| Other values (2) | 20 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 721467 | |
| Punctuation | 154 | < 0.1% |
| None | 45 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 77650 | 10.8% | |
| a | 55232 | 7.7% |
| i | 54043 | 7.5% |
| e | 50214 | 7.0% |
| , | 37806 | 5.2% |
| r | 34027 | 4.7% |
| l | 32102 | 4.4% |
| n | 30823 | 4.3% |
| . | 26916 | 3.7% |
| t | 26854 | 3.7% |
| Other values (51) | 295800 |
Punctuation
| Value | Count | Frequency (%) |
| ” | 77 | |
| “ | 77 |
None
| Value | Count | Frequency (%) |
| ñ | 31 | |
| á | 6 | 13.3% |
| Ö | 2 | 4.4% |
| ü | 2 | 4.4% |
| í | 2 | 4.4% |
| ê | 2 | 4.4% |
identifiedByID
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 14.3% |
| Missing | 455205 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ACCEPTED |
|---|---|
| 2nd row | ACCEPTED |
| 3rd row | ACCEPTED |
| 4th row | ACCEPTED |
| 5th row | ACCEPTED |
| Value | Count | Frequency (%) |
| accepted | 7 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 14 | |
| E | 14 | |
| A | 7 | |
| P | 7 | |
| T | 7 | |
| D | 7 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 56 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 14 | |
| E | 14 | |
| A | 7 | |
| P | 7 | |
| T | 7 | |
| D | 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 56 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 14 | |
| E | 14 | |
| A | 7 | |
| P | 7 | |
| T | 7 | |
| D | 7 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 56 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 14 | |
| E | 14 | |
| A | 7 | |
| P | 7 | |
| T | 7 | |
| D | 7 |
identificationVerificationStatus
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 14.3% |
| Missing | 455205 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 36 |
| Min length | 36 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
|---|---|
| 2nd row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| 3rd row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| 4th row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| 5th row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| Value | Count | Frequency (%) |
| 821cc27a-e3bb-4bc5-ac34-89ada245069d | 7 |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 28 | |
| a | 28 | |
| - | 28 | |
| 2 | 21 | |
| b | 21 | |
| 4 | 21 | |
| 8 | 14 | 5.6% |
| 3 | 14 | 5.6% |
| 5 | 14 | 5.6% |
| 9 | 14 | 5.6% |
| Other values (6) | 49 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 126 | |
| Lowercase Letter | 98 | |
| Dash Punctuation | 28 | 11.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 21 | |
| 4 | 21 | |
| 8 | 14 | |
| 3 | 14 | |
| 5 | 14 | |
| 9 | 14 | |
| 1 | 7 | 5.6% |
| 7 | 7 | 5.6% |
| 0 | 7 | 5.6% |
| 6 | 7 | 5.6% |
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 28 | |
| a | 28 | |
| b | 21 | |
| d | 14 | |
| e | 7 | 7.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 28 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 154 | |
| Latin | 98 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 28 | |
| 2 | 21 | |
| 4 | 21 | |
| 8 | 14 | |
| 3 | 14 | |
| 5 | 14 | |
| 9 | 14 | |
| 1 | 7 | 4.5% |
| 7 | 7 | 4.5% |
| 0 | 7 | 4.5% |
Latin
| Value | Count | Frequency (%) |
| c | 28 | |
| a | 28 | |
| b | 21 | |
| d | 14 | |
| e | 7 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 252 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 28 | |
| a | 28 | |
| - | 28 | |
| 2 | 21 | |
| b | 21 | |
| 4 | 21 | |
| 8 | 14 | 5.6% |
| 3 | 14 | 5.6% |
| 5 | 14 | 5.6% |
| 9 | 14 | 5.6% |
| Other values (6) | 49 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 14.3% |
| Missing | 455205 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | US |
|---|---|
| 2nd row | US |
| 3rd row | US |
| 4th row | US |
| 5th row | US |
| Value | Count | Frequency (%) |
| us | 7 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 7 | |
| S | 7 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 14 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 7 | |
| S | 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 7 | |
| S | 7 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 7 | |
| S | 7 |
taxonID
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 455205 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 24 |
| Min length | 24 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 2024-12-02T13:57:35.184Z |
|---|---|
| 2nd row | 2024-12-02T13:58:31.286Z |
| 3rd row | 2024-12-02T13:56:38.525Z |
| 4th row | 2024-12-02T13:59:43.862Z |
| 5th row | 2024-12-02T13:56:47.781Z |
| Value | Count | Frequency (%) |
| 2024-12-02t13:57:35.184z | 1 | |
| 2024-12-02t13:58:31.286z | 1 | |
| 2024-12-02t13:56:38.525z | 1 | |
| 2024-12-02t13:59:43.862z | 1 | |
| 2024-12-02t13:56:47.781z | 1 | |
| 2024-12-02t13:59:40.809z | 1 | |
| 2024-12-02t13:58:46.380z | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 31 | |
| 0 | 17 | |
| 1 | 17 | |
| - | 14 | |
| : | 14 | |
| 4 | 12 | 7.1% |
| 3 | 12 | 7.1% |
| 5 | 10 | 6.0% |
| 8 | 9 | 5.4% |
| T | 7 | 4.2% |
| Other values (5) | 25 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 119 | |
| Other Punctuation | 21 | 12.5% |
| Dash Punctuation | 14 | 8.3% |
| Uppercase Letter | 14 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 31 | |
| 0 | 17 | |
| 1 | 17 | |
| 4 | 12 | 10.1% |
| 3 | 12 | 10.1% |
| 5 | 10 | 8.4% |
| 8 | 9 | 7.6% |
| 6 | 5 | 4.2% |
| 7 | 3 | 2.5% |
| 9 | 3 | 2.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 14 | |
| . | 7 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 7 | |
| Z | 7 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 14 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 154 | |
| Latin | 14 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 31 | |
| 0 | 17 | |
| 1 | 17 | |
| - | 14 | |
| : | 14 | |
| 4 | 12 | 7.8% |
| 3 | 12 | 7.8% |
| 5 | 10 | 6.5% |
| 8 | 9 | 5.8% |
| . | 7 | 4.5% |
| Other values (3) | 11 | 7.1% |
Latin
| Value | Count | Frequency (%) |
| T | 7 | |
| Z | 7 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 168 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 31 | |
| 0 | 17 | |
| 1 | 17 | |
| - | 14 | |
| : | 14 | |
| 4 | 12 | 7.1% |
| 3 | 12 | 7.1% |
| 5 | 10 | 6.0% |
| 8 | 9 | 5.4% |
| T | 7 | 4.2% |
| Other values (5) | 25 |
| Distinct | 22054 |
|---|---|
| Distinct (%) | 4.8% |
| Missing | 211 |
| Missing (%) | < 0.1% |
| Memory size | 3.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.847697038 |
| Min length | 2 |
Unique
| Unique | 4768 ? |
|---|---|
| Unique (%) | 1.0% |
Sample
| 1st row | 5213106 |
|---|---|
| 2nd row | 7822511 |
| 3rd row | 5209001 |
| 4th row | 2359811 |
| 5th row | 2369651 |
| Value | Count | Frequency (%) |
| 4274 | 1630 | 0.4% |
| 2360481 | 1121 | 0.2% |
| 2359014 | 1113 | 0.2% |
| 2359823 | 1006 | 0.2% |
| 2376138 | 1001 | 0.2% |
| 2366967 | 904 | 0.2% |
| 2367736 | 893 | 0.2% |
| 2394503 | 857 | 0.2% |
| 2361357 | 853 | 0.2% |
| 2358931 | 760 | 0.2% |
| Other values (22044) | 444863 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 599111 | |
| 3 | 470277 | |
| 4 | 304204 | |
| 5 | 294417 | |
| 8 | 259388 | |
| 0 | 253770 | |
| 9 | 251072 | |
| 1 | 239346 | 7.7% |
| 7 | 223101 | 7.2% |
| 6 | 221023 | 7.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3115709 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 599111 | |
| 3 | 470277 | |
| 4 | 304204 | |
| 5 | 294417 | |
| 8 | 259388 | |
| 0 | 253770 | |
| 9 | 251072 | |
| 1 | 239346 | 7.7% |
| 7 | 223101 | 7.2% |
| 6 | 221023 | 7.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3115709 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 599111 | |
| 3 | 470277 | |
| 4 | 304204 | |
| 5 | 294417 | |
| 8 | 259388 | |
| 0 | 253770 | |
| 9 | 251072 | |
| 1 | 239346 | 7.7% |
| 7 | 223101 | 7.2% |
| 6 | 221023 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3115709 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 599111 | |
| 3 | 470277 | |
| 4 | 304204 | |
| 5 | 294417 | |
| 8 | 259388 | |
| 0 | 253770 | |
| 9 | 251072 | |
| 1 | 239346 | 7.7% |
| 7 | 223101 | 7.2% |
| 6 | 221023 | 7.1% |
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 455209 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 0.5 |
|---|---|
| 2nd row | 8.5 |
| 3rd row | 2.0 |
| Value | Count | Frequency (%) |
| 0.5 | 1 | |
| 8.5 | 1 | |
| 2.0 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 3 | |
| 0 | 2 | |
| 5 | 2 | |
| 8 | 1 | 11.1% |
| 2 | 1 | 11.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6 | |
| Other Punctuation | 3 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 5 | 2 | |
| 8 | 1 | |
| 2 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 9 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 3 | |
| 0 | 2 | |
| 5 | 2 | |
| 8 | 1 | 11.1% |
| 2 | 1 | 11.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 3 | |
| 0 | 2 | |
| 5 | 2 | |
| 8 | 1 | 11.1% |
| 2 | 1 | 11.1% |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 66.7% |
| Missing | 455209 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 33.3% |
Sample
| 1st row | 0.5 |
|---|---|
| 2nd row | 0.5 |
| 3rd row | 2.0 |
| Value | Count | Frequency (%) |
| 0.5 | 2 | |
| 2.0 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3 | |
| . | 3 | |
| 5 | 2 | |
| 2 | 1 | 11.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6 | |
| Other Punctuation | 3 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3 | |
| 5 | 2 | |
| 2 | 1 | 16.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 9 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3 | |
| . | 3 | |
| 5 | 2 | |
| 2 | 1 | 11.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3 | |
| . | 3 | |
| 5 | 2 | |
| 2 | 1 | 11.1% |
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 42.9% |
| Missing | 455205 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 153 |
|---|---|
| Median length | 48 |
| Mean length | 86.42857143 |
| Min length | 48 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 14.3% |
Sample
| 1st row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
|---|---|
| 2nd row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;GEODETIC_DATUM_ASSUMED_WGS84;CONTINENT_DERIVED_FROM_COORDINATES;CONTINENT_INVALID |
| 3rd row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
| 4th row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
| 5th row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;GEODETIC_DATUM_ASSUMED_WGS84;GEODETIC_DATUM_INVALID;CONTINENT_DERIVED_FROM_COORDINATES;CONTINENT_INVALID |
| Value | Count | Frequency (%) |
| occurrence_status_inferred_from_individual_count | 4 | |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;continent_derived_from_coordinates;continent_invalid | 2 | |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;geodetic_datum_invalid;continent_derived_from_coordinates;continent_invalid | 1 | 14.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| _ | 58 | |
| E | 54 | 8.9% |
| N | 53 | 8.8% |
| I | 52 | 8.6% |
| D | 45 | 7.4% |
| T | 44 | 7.3% |
| R | 44 | 7.3% |
| C | 41 | 6.8% |
| O | 40 | 6.6% |
| U | 35 | 5.8% |
| Other values (11) | 139 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 531 | |
| Connector Punctuation | 58 | 9.6% |
| Other Punctuation | 10 | 1.7% |
| Decimal Number | 6 | 1.0% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 54 | |
| N | 53 | |
| I | 52 | |
| D | 45 | |
| T | 44 | |
| R | 44 | |
| C | 41 | |
| O | 40 | |
| U | 35 | 6.6% |
| A | 28 | 5.3% |
| Other values (7) | 95 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 3 | |
| 4 | 3 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 58 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 10 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 531 | |
| Common | 74 | 12.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 54 | |
| N | 53 | |
| I | 52 | |
| D | 45 | |
| T | 44 | |
| R | 44 | |
| C | 41 | |
| O | 40 | |
| U | 35 | 6.6% |
| A | 28 | 5.3% |
| Other values (7) | 95 |
Common
| Value | Count | Frequency (%) |
| _ | 58 | |
| ; | 10 | 13.5% |
| 8 | 3 | 4.1% |
| 4 | 3 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 605 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| _ | 58 | |
| E | 54 | 8.9% |
| N | 53 | 8.8% |
| I | 52 | 8.6% |
| D | 45 | 7.4% |
| T | 44 | 7.3% |
| R | 44 | 7.3% |
| C | 41 | 6.8% |
| O | 40 | 6.6% |
| U | 35 | 5.8% |
| Other values (11) | 139 |
taxonConceptID
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 455210 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | StillImage |
|---|---|
| 2nd row | StillImage |
| Value | Count | Frequency (%) |
| stillimage | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 4 | |
| S | 2 | |
| t | 2 | |
| i | 2 | |
| I | 2 | |
| m | 2 | |
| a | 2 | |
| g | 2 | |
| e | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 16 | |
| Uppercase Letter | 4 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 4 | |
| t | 2 | |
| i | 2 | |
| m | 2 | |
| a | 2 | |
| g | 2 | |
| e | 2 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 2 | |
| I | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 20 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 4 | |
| S | 2 | |
| t | 2 | |
| i | 2 | |
| I | 2 | |
| m | 2 | |
| a | 2 | |
| g | 2 | |
| e | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 4 | |
| S | 2 | |
| t | 2 | |
| i | 2 | |
| I | 2 | |
| m | 2 | |
| a | 2 | |
| g | 2 | |
| e | 2 |
scientificName
Text
| Distinct | 28366 |
|---|---|
| Distinct (%) | 6.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.5 MiB |
Length
| Max length | 111 |
|---|---|
| Median length | 81 |
| Mean length | 34.33414102 |
| Min length | 4 |
Unique
| Unique | 8055 ? |
|---|---|
| Unique (%) | 1.8% |
Sample
| 1st row | Echidna nebulosa (Ahl, 1789) |
|---|---|
| 2nd row | Mugil Linnaeus, 1758 |
| 3rd row | Cryptocentrus filifer (Valenciennes, 1837) |
| 4th row | Rhinichthys cataractae (Valenciennes, 1842) |
| 5th row | Centropomus ensiferus Poey, 1860 |
| Value | Count | Frequency (%) |
| 74427 | 4.0% | |
| linnaeus | 26768 | 1.4% |
| bleeker | 23949 | 1.3% |
| 1758 | 20993 | 1.1% |
| valenciennes | 20020 | 1.1% |
| cuvier | 18941 | 1.0% |
| jordan | 16870 | 0.9% |
| bloch | 15687 | 0.8% |
| lacepède | 13855 | 0.7% |
| 1801 | 13309 | 0.7% |
| Other values (20362) | 1613420 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1403027 | 9.0% | |
| e | 1045274 | 6.7% |
| a | 1034703 | 6.6% |
| i | 926974 | 5.9% |
| s | 920415 | 5.9% |
| n | 772411 | 4.9% |
| r | 768150 | 4.9% |
| o | 766823 | 4.9% |
| u | 641083 | 4.1% |
| l | 587298 | 3.8% |
| Other values (81) | 6763155 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10580971 | |
| Decimal Number | 1707640 | 10.9% |
| Space Separator | 1403027 | 9.0% |
| Uppercase Letter | 966521 | 6.2% |
| Other Punctuation | 504834 | 3.2% |
| Open Punctuation | 231890 | 1.5% |
| Close Punctuation | 231890 | 1.5% |
| Dash Punctuation | 2540 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1045274 | |
| a | 1034703 | |
| i | 926974 | 8.8% |
| s | 920415 | 8.7% |
| n | 772411 | 7.3% |
| r | 768150 | 7.3% |
| o | 766823 | 7.2% |
| u | 641083 | 6.1% |
| l | 587298 | 5.6% |
| t | 578646 | 5.5% |
| Other values (35) | 2539194 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 100059 | |
| S | 98835 | |
| B | 89015 | 9.2% |
| L | 85231 | 8.8% |
| G | 84945 | 8.8% |
| P | 66777 | 6.9% |
| A | 54030 | 5.6% |
| R | 50242 | 5.2% |
| M | 48437 | 5.0% |
| E | 42560 | 4.4% |
| Other values (18) | 246390 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 499218 | |
| 8 | 355270 | |
| 9 | 178960 | 10.5% |
| 7 | 131050 | 7.7% |
| 5 | 111806 | 6.5% |
| 0 | 107029 | 6.3% |
| 2 | 87774 | 5.1% |
| 6 | 85320 | 5.0% |
| 3 | 84836 | 5.0% |
| 4 | 66377 | 3.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 430063 | |
| & | 74427 | 14.7% |
| . | 270 | 0.1% |
| ' | 74 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1403027 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 231890 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 231890 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2540 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11547492 | |
| Common | 4081821 | 26.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1045274 | 9.1% |
| a | 1034703 | 9.0% |
| i | 926974 | 8.0% |
| s | 920415 | 8.0% |
| n | 772411 | 6.7% |
| r | 768150 | 6.7% |
| o | 766823 | 6.6% |
| u | 641083 | 5.6% |
| l | 587298 | 5.1% |
| t | 578646 | 5.0% |
| Other values (63) | 3505715 |
Common
| Value | Count | Frequency (%) |
| 1403027 | ||
| 1 | 499218 | 12.2% |
| , | 430063 | 10.5% |
| 8 | 355270 | 8.7% |
| ( | 231890 | 5.7% |
| ) | 231890 | 5.7% |
| 9 | 178960 | 4.4% |
| 7 | 131050 | 3.2% |
| 5 | 111806 | 2.7% |
| 0 | 107029 | 2.6% |
| Other values (8) | 401618 | 9.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15573317 | |
| None | 55996 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1403027 | 9.0% | |
| e | 1045274 | 6.7% |
| a | 1034703 | 6.6% |
| i | 926974 | 6.0% |
| s | 920415 | 5.9% |
| n | 772411 | 5.0% |
| r | 768150 | 4.9% |
| o | 766823 | 4.9% |
| u | 641083 | 4.1% |
| l | 587298 | 3.8% |
| Other values (60) | 6707159 |
None
| Value | Count | Frequency (%) |
| ü | 24263 | |
| è | 13883 | |
| å | 11605 | |
| ö | 3033 | 5.4% |
| é | 1849 | 3.3% |
| ø | 571 | 1.0% |
| á | 277 | 0.5% |
| ó | 163 | 0.3% |
| ă | 147 | 0.3% |
| ç | 62 | 0.1% |
| Other values (11) | 143 | 0.3% |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 14.3% |
| Missing | 455205 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 7 |
Most occurring characters
| Value | Count | Frequency (%) |
| f | 7 | |
| a | 7 | |
| l | 7 | |
| s | 7 | |
| e | 7 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 35 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| f | 7 | |
| a | 7 | |
| l | 7 | |
| s | 7 | |
| e | 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 35 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| f | 7 | |
| a | 7 | |
| l | 7 | |
| s | 7 | |
| e | 7 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 35 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| f | 7 | |
| a | 7 | |
| l | 7 | |
| s | 7 | |
| e | 7 |
parentNameUsage
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 455205 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 2341036 |
|---|---|
| 2nd row | 2384468 |
| 3rd row | 2353475 |
| 4th row | 2373066 |
| 5th row | 2414948 |
| Value | Count | Frequency (%) |
| 2341036 | 1 | |
| 2384468 | 1 | |
| 2353475 | 1 | |
| 2373066 | 1 | |
| 2414948 | 1 | |
| 2393782 | 1 | |
| 2335095 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 11 | |
| 2 | 8 | |
| 4 | 7 | |
| 6 | 4 | 8.2% |
| 8 | 4 | 8.2% |
| 5 | 4 | 8.2% |
| 0 | 3 | 6.1% |
| 7 | 3 | 6.1% |
| 9 | 3 | 6.1% |
| 1 | 2 | 4.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 49 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 11 | |
| 2 | 8 | |
| 4 | 7 | |
| 6 | 4 | 8.2% |
| 8 | 4 | 8.2% |
| 5 | 4 | 8.2% |
| 0 | 3 | 6.1% |
| 7 | 3 | 6.1% |
| 9 | 3 | 6.1% |
| 1 | 2 | 4.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 49 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 11 | |
| 2 | 8 | |
| 4 | 7 | |
| 6 | 4 | 8.2% |
| 8 | 4 | 8.2% |
| 5 | 4 | 8.2% |
| 0 | 3 | 6.1% |
| 7 | 3 | 6.1% |
| 9 | 3 | 6.1% |
| 1 | 2 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 49 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 11 | |
| 2 | 8 | |
| 4 | 7 | |
| 6 | 4 | 8.2% |
| 8 | 4 | 8.2% |
| 5 | 4 | 8.2% |
| 0 | 3 | 6.1% |
| 7 | 3 | 6.1% |
| 9 | 3 | 6.1% |
| 1 | 2 | 4.1% |
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 455205 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 2341036 |
|---|---|
| 2nd row | 2384468 |
| 3rd row | 2353475 |
| 4th row | 2373066 |
| 5th row | 2414948 |
| Value | Count | Frequency (%) |
| 2341036 | 1 | |
| 2384468 | 1 | |
| 2353475 | 1 | |
| 2373066 | 1 | |
| 2414948 | 1 | |
| 2393782 | 1 | |
| 2335095 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 11 | |
| 2 | 8 | |
| 4 | 7 | |
| 6 | 4 | 8.2% |
| 8 | 4 | 8.2% |
| 5 | 4 | 8.2% |
| 0 | 3 | 6.1% |
| 7 | 3 | 6.1% |
| 9 | 3 | 6.1% |
| 1 | 2 | 4.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 49 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 11 | |
| 2 | 8 | |
| 4 | 7 | |
| 6 | 4 | 8.2% |
| 8 | 4 | 8.2% |
| 5 | 4 | 8.2% |
| 0 | 3 | 6.1% |
| 7 | 3 | 6.1% |
| 9 | 3 | 6.1% |
| 1 | 2 | 4.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 49 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 11 | |
| 2 | 8 | |
| 4 | 7 | |
| 6 | 4 | 8.2% |
| 8 | 4 | 8.2% |
| 5 | 4 | 8.2% |
| 0 | 3 | 6.1% |
| 7 | 3 | 6.1% |
| 9 | 3 | 6.1% |
| 1 | 2 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 49 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 11 | |
| 2 | 8 | |
| 4 | 7 | |
| 6 | 4 | 8.2% |
| 8 | 4 | 8.2% |
| 5 | 4 | 8.2% |
| 0 | 3 | 6.1% |
| 7 | 3 | 6.1% |
| 9 | 3 | 6.1% |
| 1 | 2 | 4.1% |
nameAccordingTo
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 14.3% |
| Missing | 455205 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 7 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 7 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 7 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 7 |
namePublishedIn
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 14.3% |
| Missing | 455205 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 44 |
|---|---|
| 2nd row | 44 |
| 3rd row | 44 |
| 4th row | 44 |
| 5th row | 44 |
| Value | Count | Frequency (%) |
| 44 | 7 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 14 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 14 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 14 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 14 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 14 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 14 |
| Distinct | 868 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 231 |
| Missing (%) | 0.1% |
| Memory size | 3.5 MiB |
Length
| Max length | 164 |
|---|---|
| Median length | 155 |
| Mean length | 131.5133379 |
| Min length | 3 |
Unique
| Unique | 71 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Animalia, Chordata, Vertebrata, Osteichthyes, Actinopterygii, Neopterygii, Elopomorpha, Anguilliformes, Muraenoidei, Muraenidae, Muraeninae |
|---|---|
| 2nd row | Animalia, Chordata, Vertebrata, Osteichthyes, Actinopterygii, Neopterygii, Acanthopterygii, Perciformes, Percoidei, Mugilidae |
| 3rd row | Animalia, Chordata, Vertebrata, Osteichthyes, Actinopterygii, Neopterygii, Acanthopterygii, Perciformes, Gobioidei, Gobiidae, Gobiinae |
| 4th row | Animalia, Chordata, Vertebrata, Osteichthyes, Actinopterygii, Neopterygii, Ostariophysi, Cypriniformes, Cyprinidae |
| 5th row | Animalia, Chordata, Vertebrata, Osteichthyes, Actinopterygii, Neopterygii, Acanthopterygii, Perciformes, Percoidei, Centropomidae |
| Value | Count | Frequency (%) |
| chordata | 454965 | 9.9% |
| animalia | 454921 | 9.9% |
| vertebrata | 454410 | 9.8% |
| osteichthyes | 444515 | 9.6% |
| actinopterygii | 444459 | 9.6% |
| neopterygii | 444025 | 9.6% |
| acanthopterygii | 293090 | 6.4% |
| perciformes | 213808 | 4.6% |
| percoidei | 96925 | 2.1% |
| ostariophysi | 67590 | 1.5% |
| Other values (974) | 1246012 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 6690834 | 11.2% |
| e | 5764651 | 9.6% |
| t | 4862198 | 8.1% |
| a | 4453200 | 7.4% |
| , | 4159739 | 7.0% |
| 4159739 | 7.0% | |
| r | 4156105 | 6.9% |
| o | 3437162 | 5.7% |
| h | 2157318 | 3.6% |
| n | 2101258 | 3.5% |
| Other values (48) | 17893866 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 46901857 | |
| Uppercase Letter | 4614713 | 7.7% |
| Other Punctuation | 4159739 | 7.0% |
| Space Separator | 4159739 | 7.0% |
| Decimal Number | 22 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 6690834 | |
| e | 5764651 | |
| t | 4862198 | |
| a | 4453200 | |
| r | 4156105 | |
| o | 3437162 | 7.3% |
| h | 2157318 | 4.6% |
| n | 2101258 | 4.5% |
| y | 1975345 | 4.2% |
| c | 1930135 | 4.1% |
| Other values (16) | 9373651 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1292509 | |
| C | 704712 | |
| O | 545269 | |
| V | 454484 | 9.8% |
| N | 451678 | 9.8% |
| P | 431310 | 9.3% |
| S | 209834 | 4.5% |
| G | 105660 | 2.3% |
| L | 88680 | 1.9% |
| B | 85568 | 1.9% |
| Other values (13) | 245009 | 5.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 6 | |
| 7 | 5 | |
| 8 | 4 | |
| 0 | 3 | |
| 3 | 2 | 9.1% |
| 9 | 1 | 4.5% |
| 1 | 1 | 4.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 4159739 |
Space Separator
| Value | Count | Frequency (%) |
| 4159739 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 51516570 | |
| Common | 8319500 | 13.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 6690834 | |
| e | 5764651 | |
| t | 4862198 | 9.4% |
| a | 4453200 | 8.6% |
| r | 4156105 | 8.1% |
| o | 3437162 | 6.7% |
| h | 2157318 | 4.2% |
| n | 2101258 | 4.1% |
| y | 1975345 | 3.8% |
| c | 1930135 | 3.7% |
| Other values (39) | 13988364 |
Common
| Value | Count | Frequency (%) |
| , | 4159739 | |
| 4159739 | ||
| 5 | 6 | < 0.1% |
| 7 | 5 | < 0.1% |
| 8 | 4 | < 0.1% |
| 0 | 3 | < 0.1% |
| 3 | 2 | < 0.1% |
| 9 | 1 | < 0.1% |
| 1 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 59836070 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 6690834 | 11.2% |
| e | 5764651 | 9.6% |
| t | 4862198 | 8.1% |
| a | 4453200 | 7.4% |
| , | 4159739 | 7.0% |
| 4159739 | 7.0% | |
| r | 4156105 | 6.9% |
| o | 3437162 | 5.7% |
| h | 2157318 | 3.6% |
| n | 2101258 | 3.5% |
| Other values (48) | 17893866 |
kingdom
Text
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.5 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 8 |
| Mean length | 8.00267348 |
| Min length | 4 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Animalia |
|---|---|
| 2nd row | Animalia |
| 3rd row | Animalia |
| 4th row | Animalia |
| 5th row | Animalia |
| Value | Count | Frequency (%) |
| animalia | 454998 | |
| incertae | 207 | < 0.1% |
| sedis | 207 | < 0.1% |
| 5153 | 1 | < 0.1% |
| 8535 | 1 | < 0.1% |
| 6880497 | 1 | < 0.1% |
| 8522 | 1 | < 0.1% |
| 4215 | 1 | < 0.1% |
| 4504 | 1 | < 0.1% |
| 5097 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 910410 | |
| a | 910203 | |
| n | 455205 | |
| A | 454998 | |
| m | 454998 | |
| l | 454998 | |
| e | 621 | < 0.1% |
| s | 414 | < 0.1% |
| r | 207 | < 0.1% |
| t | 207 | < 0.1% |
| Other values (13) | 652 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3187677 | |
| Uppercase Letter | 454998 | 12.5% |
| Space Separator | 207 | < 0.1% |
| Decimal Number | 31 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 910410 | |
| a | 910203 | |
| n | 455205 | |
| m | 454998 | |
| l | 454998 | |
| e | 621 | < 0.1% |
| s | 414 | < 0.1% |
| r | 207 | < 0.1% |
| t | 207 | < 0.1% |
| c | 207 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 8 | |
| 8 | 4 | |
| 4 | 4 | |
| 0 | 3 | 9.7% |
| 2 | 3 | 9.7% |
| 1 | 2 | 6.5% |
| 3 | 2 | 6.5% |
| 9 | 2 | 6.5% |
| 7 | 2 | 6.5% |
| 6 | 1 | 3.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 454998 |
Space Separator
| Value | Count | Frequency (%) |
| 207 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3642675 | |
| Common | 238 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 910410 | |
| a | 910203 | |
| n | 455205 | |
| A | 454998 | |
| m | 454998 | |
| l | 454998 | |
| e | 621 | < 0.1% |
| s | 414 | < 0.1% |
| r | 207 | < 0.1% |
| t | 207 | < 0.1% |
| Other values (2) | 414 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 207 | ||
| 5 | 8 | 3.4% |
| 8 | 4 | 1.7% |
| 4 | 4 | 1.7% |
| 0 | 3 | 1.3% |
| 2 | 3 | 1.3% |
| 1 | 2 | 0.8% |
| 3 | 2 | 0.8% |
| 9 | 2 | 0.8% |
| 7 | 2 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3642913 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 910410 | |
| a | 910203 | |
| n | 455205 | |
| A | 454998 | |
| m | 454998 | |
| l | 454998 | |
| e | 621 | < 0.1% |
| s | 414 | < 0.1% |
| r | 207 | < 0.1% |
| t | 207 | < 0.1% |
| Other values (13) | 652 | < 0.1% |
phylum
Text
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 285 |
| Missing (%) | 0.1% |
| Memory size | 3.5 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 8 |
| Mean length | 8.000015387 |
| Min length | 7 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Chordata |
|---|---|
| 2nd row | Chordata |
| 3rd row | Chordata |
| 4th row | Chordata |
| 5th row | Chordata |
| Value | Count | Frequency (%) |
| chordata | 454913 | |
| arthropoda | 7 | < 0.1% |
| 2341007 | 1 | < 0.1% |
| 2384450 | 1 | < 0.1% |
| 2353451 | 1 | < 0.1% |
| 2373062 | 1 | < 0.1% |
| 2414937 | 1 | < 0.1% |
| 2371535 | 1 | < 0.1% |
| 2335094 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 909833 | |
| o | 454927 | |
| r | 454927 | |
| h | 454920 | |
| d | 454920 | |
| t | 454920 | |
| C | 454913 | |
| 3 | 11 | < 0.1% |
| 2 | 8 | < 0.1% |
| p | 7 | < 0.1% |
| Other values (9) | 37 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3184454 | |
| Uppercase Letter | 454920 | 12.5% |
| Decimal Number | 49 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 11 | |
| 2 | 8 | |
| 4 | 7 | |
| 5 | 6 | |
| 0 | 5 | |
| 1 | 4 | 8.2% |
| 7 | 4 | 8.2% |
| 9 | 2 | 4.1% |
| 8 | 1 | 2.0% |
| 6 | 1 | 2.0% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 909833 | |
| o | 454927 | |
| r | 454927 | |
| h | 454920 | |
| d | 454920 | |
| t | 454920 | |
| p | 7 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 454913 | |
| A | 7 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3639374 | |
| Common | 49 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 11 | |
| 2 | 8 | |
| 4 | 7 | |
| 5 | 6 | |
| 0 | 5 | |
| 1 | 4 | 8.2% |
| 7 | 4 | 8.2% |
| 9 | 2 | 4.1% |
| 8 | 1 | 2.0% |
| 6 | 1 | 2.0% |
Latin
| Value | Count | Frequency (%) |
| a | 909833 | |
| o | 454927 | |
| r | 454927 | |
| h | 454920 | |
| d | 454920 | |
| t | 454920 | |
| C | 454913 | |
| p | 7 | < 0.1% |
| A | 7 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3639423 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 909833 | |
| o | 454927 | |
| r | 454927 | |
| h | 454920 | |
| d | 454920 | |
| t | 454920 | |
| C | 454913 | |
| 3 | 11 | < 0.1% |
| 2 | 8 | < 0.1% |
| p | 7 | < 0.1% |
| Other values (9) | 37 | < 0.1% |
class
Text
Missing 
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 444746 |
| Missing (%) | 97.7% |
| Memory size | 3.5 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 13.50496847 |
| Min length | 6 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Elasmobranchii |
|---|---|
| 2nd row | Petromyzonti |
| 3rd row | Petromyzonti |
| 4th row | Elasmobranchii |
| 5th row | Elasmobranchii |
| Value | Count | Frequency (%) |
| elasmobranchii | 8825 | |
| petromyzonti | 565 | 5.4% |
| leptocardii | 514 | 4.9% |
| holocephali | 362 | 3.5% |
| myxini | 150 | 1.4% |
| dipneusti | 28 | 0.3% |
| coelacanthi | 14 | 0.1% |
| arachnida | 7 | 0.1% |
| amphibia | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 19984 | |
| a | 18569 | |
| o | 11207 | |
| r | 9911 | 7.0% |
| c | 9722 | 6.9% |
| n | 9589 | 6.8% |
| l | 9563 | 6.8% |
| m | 9391 | 6.6% |
| h | 9209 | 6.5% |
| s | 8853 | 6.3% |
| Other values (17) | 25345 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 130877 | |
| Uppercase Letter | 10466 | 7.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 19984 | |
| a | 18569 | |
| o | 11207 | |
| r | 9911 | |
| c | 9722 | |
| n | 9589 | |
| l | 9563 | |
| m | 9391 | |
| h | 9209 | |
| s | 8853 | |
| Other values (9) | 14879 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 8825 | |
| P | 565 | 5.4% |
| L | 514 | 4.9% |
| H | 362 | 3.5% |
| M | 150 | 1.4% |
| D | 28 | 0.3% |
| C | 14 | 0.1% |
| A | 8 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 141343 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 19984 | |
| a | 18569 | |
| o | 11207 | |
| r | 9911 | 7.0% |
| c | 9722 | 6.9% |
| n | 9589 | 6.8% |
| l | 9563 | 6.8% |
| m | 9391 | 6.6% |
| h | 9209 | 6.5% |
| s | 8853 | 6.3% |
| Other values (17) | 25345 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 141343 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 19984 | |
| a | 18569 | |
| o | 11207 | |
| r | 9911 | 7.0% |
| c | 9722 | 6.9% |
| n | 9589 | 6.8% |
| l | 9563 | 6.8% |
| m | 9391 | 6.6% |
| h | 9209 | 6.5% |
| s | 8853 | 6.3% |
| Other values (17) | 25345 |
order
Text
| Distinct | 71 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1001 |
| Missing (%) | 0.2% |
| Memory size | 3.5 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 19 |
| Mean length | 12.46148376 |
| Min length | 7 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Anguilliformes |
|---|---|
| 2nd row | Mugiliformes |
| 3rd row | Perciformes |
| 4th row | Cypriniformes |
| 5th row | Perciformes |
| Value | Count | Frequency (%) |
| perciformes | 212582 | |
| cypriniformes | 33752 | 7.4% |
| scorpaeniformes | 17672 | 3.9% |
| characiformes | 17478 | 3.8% |
| anguilliformes | 17113 | 3.8% |
| siluriformes | 14280 | 3.1% |
| myctophiformes | 13708 | 3.0% |
| pleuronectiformes | 12320 | 2.7% |
| stomiiformes | 12085 | 2.7% |
| tetraodontiformes | 10526 | 2.3% |
| Other values (61) | 92695 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 810127 | |
| e | 762624 | |
| o | 587988 | |
| i | 567492 | |
| m | 475972 | |
| s | 464456 | |
| f | 454197 | |
| c | 292652 | 5.2% |
| P | 226159 | 4.0% |
| n | 146439 | 2.6% |
| Other values (38) | 872037 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5205890 | |
| Uppercase Letter | 454204 | 8.0% |
| Decimal Number | 49 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 810127 | |
| e | 762624 | |
| o | 587988 | |
| i | 567492 | |
| m | 475972 | |
| s | 464456 | |
| f | 454197 | |
| c | 292652 | 5.6% |
| n | 146439 | 2.8% |
| p | 101430 | 1.9% |
| Other values (13) | 542513 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 226159 | |
| C | 72877 | 16.0% |
| S | 56796 | 12.5% |
| A | 29049 | 6.4% |
| B | 17150 | 3.8% |
| M | 16497 | 3.6% |
| T | 10795 | 2.4% |
| G | 9454 | 2.1% |
| O | 7183 | 1.6% |
| L | 4153 | 0.9% |
| Other values (5) | 4091 | 0.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 11 | |
| 2 | 8 | |
| 4 | 7 | |
| 6 | 4 | 8.2% |
| 8 | 4 | 8.2% |
| 5 | 4 | 8.2% |
| 0 | 3 | 6.1% |
| 7 | 3 | 6.1% |
| 9 | 3 | 6.1% |
| 1 | 2 | 4.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5660094 | |
| Common | 49 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 810127 | |
| e | 762624 | |
| o | 587988 | |
| i | 567492 | |
| m | 475972 | |
| s | 464456 | |
| f | 454197 | |
| c | 292652 | 5.2% |
| P | 226159 | 4.0% |
| n | 146439 | 2.6% |
| Other values (28) | 871988 |
Common
| Value | Count | Frequency (%) |
| 3 | 11 | |
| 2 | 8 | |
| 4 | 7 | |
| 6 | 4 | 8.2% |
| 8 | 4 | 8.2% |
| 5 | 4 | 8.2% |
| 0 | 3 | 6.1% |
| 7 | 3 | 6.1% |
| 9 | 3 | 6.1% |
| 1 | 2 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5660143 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 810127 | |
| e | 762624 | |
| o | 587988 | |
| i | 567492 | |
| m | 475972 | |
| s | 464456 | |
| f | 454197 | |
| c | 292652 | 5.2% |
| P | 226159 | 4.0% |
| n | 146439 | 2.6% |
| Other values (38) | 872037 |
superfamily
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 455205 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 18 |
| Mean length | 18.28571429 |
| Min length | 15 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Noturus nocturnus |
|---|---|
| 2nd row | Thalassoma lunare |
| 3rd row | Brycon falcatus |
| 4th row | Pseudotropheus elongatus |
| 5th row | Halieutaea brevicauda |
| Value | Count | Frequency (%) |
| noturus | 1 | 7.1% |
| nocturnus | 1 | 7.1% |
| thalassoma | 1 | 7.1% |
| lunare | 1 | 7.1% |
| brycon | 1 | 7.1% |
| falcatus | 1 | 7.1% |
| pseudotropheus | 1 | 7.1% |
| elongatus | 1 | 7.1% |
| halieutaea | 1 | 7.1% |
| brevicauda | 1 | 7.1% |
| Other values (4) | 4 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 14 | |
| a | 14 | |
| u | 13 | 10.2% |
| o | 9 | 7.0% |
| r | 9 | 7.0% |
| e | 9 | 7.0% |
| 7 | 5.5% | |
| c | 7 | 5.5% |
| l | 7 | 5.5% |
| t | 6 | 4.7% |
| Other values (17) | 33 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 114 | |
| Space Separator | 7 | 5.5% |
| Uppercase Letter | 7 | 5.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 14 | |
| a | 14 | |
| u | 13 | |
| o | 9 | |
| r | 9 | |
| e | 9 | |
| c | 7 | 6.1% |
| l | 7 | 6.1% |
| t | 6 | 5.3% |
| n | 5 | 4.4% |
| Other values (10) | 21 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 2 | |
| H | 1 | |
| N | 1 | |
| P | 1 | |
| T | 1 | |
| S | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 121 | |
| Common | 7 | 5.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 14 | |
| a | 14 | |
| u | 13 | |
| o | 9 | 7.4% |
| r | 9 | 7.4% |
| e | 9 | 7.4% |
| c | 7 | 5.8% |
| l | 7 | 5.8% |
| t | 6 | 5.0% |
| n | 5 | 4.1% |
| Other values (16) | 28 |
Common
| Value | Count | Frequency (%) |
| 7 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 128 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 14 | |
| a | 14 | |
| u | 13 | 10.2% |
| o | 9 | 7.0% |
| r | 9 | 7.0% |
| e | 9 | 7.0% |
| 7 | 5.5% | |
| c | 7 | 5.5% |
| l | 7 | 5.5% |
| t | 6 | 4.7% |
| Other values (17) | 33 |
family
Text
| Distinct | 561 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 833 |
| Missing (%) | 0.2% |
| Memory size | 3.5 MiB |
Length
| Max length | 40 |
|---|---|
| Median length | 36 |
| Mean length | 10.79361062 |
| Min length | 6 |
Unique
| Unique | 23 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Muraenidae |
|---|---|
| 2nd row | Mugilidae |
| 3rd row | Gobiidae |
| 4th row | Cyprinidae |
| 5th row | Centropomidae |
| Value | Count | Frequency (%) |
| cyprinidae | 27640 | 6.1% |
| gobiidae | 26017 | 5.7% |
| pomacentridae | 16208 | 3.6% |
| labridae | 14638 | 3.2% |
| blenniidae | 14508 | 3.2% |
| myctophidae | 13553 | 3.0% |
| apogonidae | 12381 | 2.7% |
| serranidae | 11376 | 2.5% |
| characidae | 9124 | 2.0% |
| stomiidae | 7881 | 1.7% |
| Other values (575) | 301078 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 707945 | |
| e | 650924 | |
| i | 646771 | |
| d | 489722 | |
| o | 279193 | 5.7% |
| r | 276574 | 5.6% |
| n | 253779 | 5.2% |
| t | 211871 | 4.3% |
| c | 160423 | 3.3% |
| h | 139416 | 2.8% |
| Other values (53) | 1087772 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4449936 | |
| Uppercase Letter | 454388 | 9.3% |
| Decimal Number | 28 | < 0.1% |
| Space Separator | 25 | < 0.1% |
| Other Punctuation | 9 | < 0.1% |
| Open Punctuation | 2 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 707945 | |
| e | 650924 | |
| i | 646771 | |
| d | 489722 | |
| o | 279193 | 6.3% |
| r | 276574 | 6.2% |
| n | 253779 | 5.7% |
| t | 211871 | 4.8% |
| c | 160423 | 3.6% |
| h | 139416 | 3.1% |
| Other values (17) | 633318 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 99617 | |
| S | 65857 | |
| P | 51368 | |
| M | 37729 | 8.3% |
| A | 35150 | 7.7% |
| G | 34853 | 7.7% |
| L | 29900 | 6.6% |
| B | 27207 | 6.0% |
| H | 16335 | 3.6% |
| E | 14399 | 3.2% |
| Other values (13) | 41973 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 9 | |
| 8 | 6 | |
| 4 | 4 | |
| 0 | 2 | 7.1% |
| 5 | 2 | 7.1% |
| 9 | 2 | 7.1% |
| 6 | 2 | 7.1% |
| 7 | 1 | 3.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 7 | |
| & | 2 | 22.2% |
Space Separator
| Value | Count | Frequency (%) |
| 25 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4904324 | |
| Common | 66 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 707945 | |
| e | 650924 | |
| i | 646771 | |
| d | 489722 | |
| o | 279193 | 5.7% |
| r | 276574 | 5.6% |
| n | 253779 | 5.2% |
| t | 211871 | 4.3% |
| c | 160423 | 3.3% |
| h | 139416 | 2.8% |
| Other values (40) | 1087706 |
Common
| Value | Count | Frequency (%) |
| 25 | ||
| 1 | 9 | 13.6% |
| , | 7 | 10.6% |
| 8 | 6 | 9.1% |
| 4 | 4 | 6.1% |
| 0 | 2 | 3.0% |
| ( | 2 | 3.0% |
| ) | 2 | 3.0% |
| 5 | 2 | 3.0% |
| 9 | 2 | 3.0% |
| Other values (3) | 5 | 7.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4904389 | |
| None | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 707945 | |
| e | 650924 | |
| i | 646771 | |
| d | 489722 | |
| o | 279193 | 5.7% |
| r | 276574 | 5.6% |
| n | 253779 | 5.2% |
| t | 211871 | 4.3% |
| c | 160423 | 3.3% |
| h | 139416 | 2.8% |
| Other values (52) | 1087771 |
None
| Value | Count | Frequency (%) |
| ü | 1 |
subfamily
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 455205 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 18 |
| Mean length | 18.28571429 |
| Min length | 15 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Noturus nocturnus |
|---|---|
| 2nd row | Thalassoma lunare |
| 3rd row | Brycon falcatus |
| 4th row | Pseudotropheus elongatus |
| 5th row | Halieutaea brevicauda |
| Value | Count | Frequency (%) |
| noturus | 1 | 7.1% |
| nocturnus | 1 | 7.1% |
| thalassoma | 1 | 7.1% |
| lunare | 1 | 7.1% |
| brycon | 1 | 7.1% |
| falcatus | 1 | 7.1% |
| pseudotropheus | 1 | 7.1% |
| elongatus | 1 | 7.1% |
| halieutaea | 1 | 7.1% |
| brevicauda | 1 | 7.1% |
| Other values (4) | 4 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 14 | |
| a | 14 | |
| u | 13 | 10.2% |
| o | 9 | 7.0% |
| r | 9 | 7.0% |
| e | 9 | 7.0% |
| 7 | 5.5% | |
| c | 7 | 5.5% |
| l | 7 | 5.5% |
| t | 6 | 4.7% |
| Other values (17) | 33 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 114 | |
| Space Separator | 7 | 5.5% |
| Uppercase Letter | 7 | 5.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 14 | |
| a | 14 | |
| u | 13 | |
| o | 9 | |
| r | 9 | |
| e | 9 | |
| c | 7 | 6.1% |
| l | 7 | 6.1% |
| t | 6 | 5.3% |
| n | 5 | 4.4% |
| Other values (10) | 21 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 2 | |
| H | 1 | |
| N | 1 | |
| P | 1 | |
| T | 1 | |
| S | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 121 | |
| Common | 7 | 5.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 14 | |
| a | 14 | |
| u | 13 | |
| o | 9 | 7.4% |
| r | 9 | 7.4% |
| e | 9 | 7.4% |
| c | 7 | 5.8% |
| l | 7 | 5.8% |
| t | 6 | 5.0% |
| n | 5 | 4.1% |
| Other values (16) | 28 |
Common
| Value | Count | Frequency (%) |
| 7 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 128 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 14 | |
| a | 14 | |
| u | 13 | 10.2% |
| o | 9 | 7.0% |
| r | 9 | 7.0% |
| e | 9 | 7.0% |
| 7 | 5.5% | |
| c | 7 | 5.5% |
| l | 7 | 5.5% |
| t | 6 | 4.7% |
| Other values (17) | 33 |
subtribe
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 14.3% |
| Missing | 455205 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | EML |
|---|---|
| 2nd row | EML |
| 3rd row | EML |
| 4th row | EML |
| 5th row | EML |
| Value | Count | Frequency (%) |
| eml | 7 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 7 | |
| M | 7 | |
| L | 7 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 21 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 7 | |
| M | 7 | |
| L | 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 21 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 7 | |
| M | 7 | |
| L | 7 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 7 | |
| M | 7 | |
| L | 7 |
genus
Text
Missing 
| Distinct | 4427 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 23586 |
| Missing (%) | 5.2% |
| Memory size | 3.5 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 19 |
| Mean length | 9.89925074 |
| Min length | 3 |
Unique
| Unique | 411 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Echidna |
|---|---|
| 2nd row | Mugil |
| 3rd row | Myersina |
| 4th row | Rhinichthys |
| 5th row | Centropomus |
| Value | Count | Frequency (%) |
| etheostoma | 5026 | 1.2% |
| gymnothorax | 4350 | 1.0% |
| lepomis | 4347 | 1.0% |
| notropis | 4334 | 1.0% |
| chaetodon | 4239 | 1.0% |
| lutjanus | 3825 | 0.9% |
| halichoeres | 3118 | 0.7% |
| chromis | 3031 | 0.7% |
| acanthurus | 2923 | 0.7% |
| pomacentrus | 2919 | 0.7% |
| Other values (4417) | 393514 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 399528 | 9.4% |
| s | 397208 | 9.3% |
| a | 333959 | 7.8% |
| i | 300262 | 7.0% |
| e | 279207 | 6.5% |
| r | 261041 | 6.1% |
| u | 247831 | 5.8% |
| t | 244352 | 5.7% |
| n | 224564 | 5.3% |
| h | 210029 | 4.9% |
| Other values (55) | 1374793 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3840987 | |
| Uppercase Letter | 431633 | 10.1% |
| Decimal Number | 119 | < 0.1% |
| Other Punctuation | 21 | < 0.1% |
| Dash Punctuation | 14 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 399528 | |
| s | 397208 | |
| a | 333959 | 8.7% |
| i | 300262 | 7.8% |
| e | 279207 | 7.3% |
| r | 261041 | 6.8% |
| u | 247831 | 6.5% |
| t | 244352 | 6.4% |
| n | 224564 | 5.8% |
| h | 210029 | 5.5% |
| Other values (16) | 943006 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 61021 | |
| P | 51248 | |
| S | 48961 | |
| A | 40636 | |
| E | 28869 | 6.7% |
| L | 26991 | 6.3% |
| M | 24971 | 5.8% |
| H | 24933 | 5.8% |
| N | 18765 | 4.3% |
| G | 18448 | 4.3% |
| Other values (16) | 86790 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 31 | |
| 0 | 17 | |
| 1 | 17 | |
| 4 | 12 | 10.1% |
| 3 | 12 | 10.1% |
| 5 | 10 | 8.4% |
| 8 | 9 | 7.6% |
| 6 | 5 | 4.2% |
| 9 | 3 | 2.5% |
| 7 | 3 | 2.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 14 | |
| . | 7 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 14 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4272620 | |
| Common | 154 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 399528 | 9.4% |
| s | 397208 | 9.3% |
| a | 333959 | 7.8% |
| i | 300262 | 7.0% |
| e | 279207 | 6.5% |
| r | 261041 | 6.1% |
| u | 247831 | 5.8% |
| t | 244352 | 5.7% |
| n | 224564 | 5.3% |
| h | 210029 | 4.9% |
| Other values (42) | 1374639 |
Common
| Value | Count | Frequency (%) |
| 2 | 31 | |
| 0 | 17 | |
| 1 | 17 | |
| : | 14 | |
| - | 14 | |
| 4 | 12 | 7.8% |
| 3 | 12 | 7.8% |
| 5 | 10 | 6.5% |
| 8 | 9 | 5.8% |
| . | 7 | 4.5% |
| Other values (3) | 11 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4272774 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 399528 | 9.4% |
| s | 397208 | 9.3% |
| a | 333959 | 7.8% |
| i | 300262 | 7.0% |
| e | 279207 | 6.5% |
| r | 261041 | 6.1% |
| u | 247831 | 5.8% |
| t | 244352 | 5.7% |
| n | 224564 | 5.3% |
| h | 210029 | 4.9% |
| Other values (55) | 1374793 |
genericName
Text
Missing 
| Distinct | 5329 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 23579 |
| Missing (%) | 5.2% |
| Memory size | 3.5 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 19 |
| Mean length | 9.850122674 |
| Min length | 2 |
Unique
| Unique | 744 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Echidna |
|---|---|
| 2nd row | Mugil |
| 3rd row | Cryptocentrus |
| 4th row | Rhinichthys |
| 5th row | Centropomus |
| Value | Count | Frequency (%) |
| notropis | 7148 | 1.7% |
| etheostoma | 4849 | 1.1% |
| gymnothorax | 4324 | 1.0% |
| lepomis | 4276 | 1.0% |
| chaetodon | 4249 | 1.0% |
| lutjanus | 3807 | 0.9% |
| halichoeres | 3126 | 0.7% |
| chromis | 3122 | 0.7% |
| pomacentrus | 2956 | 0.7% |
| acanthurus | 2892 | 0.7% |
| Other values (5319) | 390884 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 401047 | 9.4% |
| s | 398676 | 9.4% |
| a | 333316 | 7.8% |
| i | 299389 | 7.0% |
| e | 276042 | 6.5% |
| r | 259555 | 6.1% |
| t | 246425 | 5.8% |
| u | 244647 | 5.8% |
| n | 220237 | 5.2% |
| h | 207615 | 4.9% |
| Other values (52) | 1364689 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3819844 | |
| Uppercase Letter | 431640 | 10.2% |
| Decimal Number | 119 | < 0.1% |
| Other Punctuation | 21 | < 0.1% |
| Dash Punctuation | 14 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 401047 | |
| s | 398676 | |
| a | 333316 | 8.7% |
| i | 299389 | 7.8% |
| e | 276042 | 7.2% |
| r | 259555 | 6.8% |
| t | 246425 | 6.5% |
| u | 244647 | 6.4% |
| n | 220237 | 5.8% |
| h | 207615 | 5.4% |
| Other values (16) | 932895 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 59157 | |
| P | 51434 | |
| S | 48641 | |
| A | 41597 | |
| E | 28275 | 6.6% |
| L | 26626 | 6.2% |
| H | 25136 | 5.8% |
| M | 25069 | 5.8% |
| N | 20800 | 4.8% |
| G | 18793 | 4.4% |
| Other values (16) | 86112 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 35 | |
| 1 | 28 | |
| 4 | 21 | |
| 0 | 14 | 11.8% |
| 8 | 7 | 5.9% |
| 3 | 7 | 5.9% |
| 6 | 7 | 5.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 14 | |
| . | 7 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 14 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4251484 | |
| Common | 154 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 401047 | 9.4% |
| s | 398676 | 9.4% |
| a | 333316 | 7.8% |
| i | 299389 | 7.0% |
| e | 276042 | 6.5% |
| r | 259555 | 6.1% |
| t | 246425 | 5.8% |
| u | 244647 | 5.8% |
| n | 220237 | 5.2% |
| h | 207615 | 4.9% |
| Other values (42) | 1364535 |
Common
| Value | Count | Frequency (%) |
| 2 | 35 | |
| 1 | 28 | |
| 4 | 21 | |
| 0 | 14 | 9.1% |
| - | 14 | 9.1% |
| : | 14 | 9.1% |
| 8 | 7 | 4.5% |
| 3 | 7 | 4.5% |
| . | 7 | 4.5% |
| 6 | 7 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4251638 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 401047 | 9.4% |
| s | 398676 | 9.4% |
| a | 333316 | 7.8% |
| i | 299389 | 7.0% |
| e | 276042 | 6.5% |
| r | 259555 | 6.1% |
| t | 246425 | 5.8% |
| u | 244647 | 5.8% |
| n | 220237 | 5.2% |
| h | 207615 | 4.9% |
| Other values (52) | 1364689 |
subgenus
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 33.3% |
| Missing | 455206 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 4 |
| Mean length | 4.166666667 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 16.7% |
Sample
| 1st row | false |
|---|---|
| 2nd row | true |
| 3rd row | true |
| 4th row | true |
| 5th row | true |
| Value | Count | Frequency (%) |
| true | 5 | |
| false | 1 | 16.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 6 | |
| t | 5 | |
| r | 5 | |
| u | 5 | |
| f | 1 | 4.0% |
| a | 1 | 4.0% |
| l | 1 | 4.0% |
| s | 1 | 4.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 25 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 6 | |
| t | 5 | |
| r | 5 | |
| u | 5 | |
| f | 1 | 4.0% |
| a | 1 | 4.0% |
| l | 1 | 4.0% |
| s | 1 | 4.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 25 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 6 | |
| t | 5 | |
| r | 5 | |
| u | 5 | |
| f | 1 | 4.0% |
| a | 1 | 4.0% |
| l | 1 | 4.0% |
| s | 1 | 4.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 25 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 6 | |
| t | 5 | |
| r | 5 | |
| u | 5 | |
| f | 1 | 4.0% |
| a | 1 | 4.0% |
| l | 1 | 4.0% |
| s | 1 | 4.0% |
specificEpithet
Text
Missing 
| Distinct | 12528 |
|---|---|
| Distinct (%) | 3.3% |
| Missing | 70259 |
| Missing (%) | 15.4% |
| Memory size | 3.5 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 17 |
| Mean length | 8.890235951 |
| Min length | 2 |
Unique
| Unique | 2693 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | nebulosa |
|---|---|
| 2nd row | filifer |
| 3rd row | cataractae |
| 4th row | ensiferus |
| 5th row | inferomaculata |
| Value | Count | Frequency (%) |
| maculatus | 1803 | 0.5% |
| fasciatus | 1624 | 0.4% |
| lineatus | 1573 | 0.4% |
| punctatus | 1558 | 0.4% |
| affinis | 1520 | 0.4% |
| ocellatus | 1448 | 0.4% |
| nigricans | 1438 | 0.4% |
| cornutus | 1264 | 0.3% |
| notatus | 1167 | 0.3% |
| niger | 1160 | 0.3% |
| Other values (12518) | 370398 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 384962 | |
| s | 357791 | |
| i | 357450 | |
| u | 276235 | 8.1% |
| e | 241281 | 7.1% |
| r | 227543 | 6.6% |
| t | 215016 | 6.3% |
| n | 207432 | 6.1% |
| o | 194229 | 5.7% |
| l | 192527 | 5.6% |
| Other values (19) | 767857 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3422237 | |
| Dash Punctuation | 86 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 384962 | |
| s | 357791 | |
| i | 357450 | |
| u | 276235 | 8.1% |
| e | 241281 | 7.1% |
| r | 227543 | 6.6% |
| t | 215016 | 6.3% |
| n | 207432 | 6.1% |
| o | 194229 | 5.7% |
| l | 192527 | 5.6% |
| Other values (18) | 767771 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 86 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3422237 | |
| Common | 86 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 384962 | |
| s | 357791 | |
| i | 357450 | |
| u | 276235 | 8.1% |
| e | 241281 | 7.1% |
| r | 227543 | 6.6% |
| t | 215016 | 6.3% |
| n | 207432 | 6.1% |
| o | 194229 | 5.7% |
| l | 192527 | 5.6% |
| Other values (18) | 767771 |
Common
| Value | Count | Frequency (%) |
| - | 86 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3422320 | |
| None | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 384962 | |
| s | 357791 | |
| i | 357450 | |
| u | 276235 | 8.1% |
| e | 241281 | 7.1% |
| r | 227543 | 6.6% |
| t | 215016 | 6.3% |
| n | 207432 | 6.1% |
| o | 194229 | 5.7% |
| l | 192527 | 5.6% |
| Other values (17) | 767854 |
None
| Value | Count | Frequency (%) |
| ü | 2 | |
| ö | 1 |
Missing 
| Distinct | 681 |
|---|---|
| Distinct (%) | 8.3% |
| Missing | 447018 |
| Missing (%) | 98.2% |
| Memory size | 3.5 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 16 |
| Mean length | 8.942762997 |
| Min length | 3 |
Unique
| Unique | 208 ? |
|---|---|
| Unique (%) | 2.5% |
Sample
| 1st row | niloticus |
|---|---|
| 2nd row | ramosus |
| 3rd row | vexillare |
| 4th row | vermiculatus |
| 5th row | exilicauda |
| Value | Count | Frequency (%) |
| leptocephalus | 303 | 3.7% |
| atromaculatus | 222 | 2.7% |
| crocodilus | 221 | 2.7% |
| atratulus | 169 | 2.1% |
| vermiculatus | 156 | 1.9% |
| ferox | 145 | 1.8% |
| commersonnii | 138 | 1.7% |
| interocularis | 121 | 1.5% |
| purpurescens | 120 | 1.5% |
| salmoides | 114 | 1.4% |
| Other values (671) | 6485 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 8034 | |
| a | 7736 | |
| i | 6963 | |
| u | 6427 | |
| r | 5139 | 7.0% |
| e | 5070 | 6.9% |
| o | 5038 | 6.9% |
| l | 4896 | 6.7% |
| c | 4349 | 5.9% |
| t | 4159 | 5.7% |
| Other values (17) | 15466 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 73265 | |
| Dash Punctuation | 12 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 8034 | |
| a | 7736 | |
| i | 6963 | |
| u | 6427 | |
| r | 5139 | 7.0% |
| e | 5070 | 6.9% |
| o | 5038 | 6.9% |
| l | 4896 | 6.7% |
| c | 4349 | 5.9% |
| t | 4159 | 5.7% |
| Other values (16) | 15454 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 12 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 73265 | |
| Common | 12 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 8034 | |
| a | 7736 | |
| i | 6963 | |
| u | 6427 | |
| r | 5139 | 7.0% |
| e | 5070 | 6.9% |
| o | 5038 | 6.9% |
| l | 4896 | 6.7% |
| c | 4349 | 5.9% |
| t | 4159 | 5.7% |
| Other values (16) | 15454 |
Common
| Value | Count | Frequency (%) |
| - | 12 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 73277 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 8034 | |
| a | 7736 | |
| i | 6963 | |
| u | 6427 | |
| r | 5139 | 7.0% |
| e | 5070 | 6.9% |
| o | 5038 | 6.9% |
| l | 4896 | 6.7% |
| c | 4349 | 5.9% |
| t | 4159 | 5.7% |
| Other values (17) | 15466 |
cultivarEpithet
Text
Missing 
| Distinct | 5 |
|---|---|
| Distinct (%) | 83.3% |
| Missing | 455206 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 7 |
| Mean length | 8.166666667 |
| Min length | 4 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 66.7% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | AFRICA |
| 3rd row | LATIN_AMERICA |
| 4th row | AFRICA |
| 5th row | ASIA |
| Value | Count | Frequency (%) |
| africa | 2 | |
| north_america | 1 | |
| latin_america | 1 | |
| asia | 1 | |
| oceania | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 13 | |
| I | 7 | |
| R | 5 | 10.2% |
| C | 5 | 10.2% |
| N | 3 | 6.1% |
| E | 3 | 6.1% |
| F | 2 | 4.1% |
| O | 2 | 4.1% |
| T | 2 | 4.1% |
| _ | 2 | 4.1% |
| Other values (4) | 5 | 10.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 47 | |
| Connector Punctuation | 2 | 4.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 13 | |
| I | 7 | |
| R | 5 | 10.6% |
| C | 5 | 10.6% |
| N | 3 | 6.4% |
| E | 3 | 6.4% |
| F | 2 | 4.3% |
| O | 2 | 4.3% |
| T | 2 | 4.3% |
| M | 2 | 4.3% |
| Other values (3) | 3 | 6.4% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 47 | |
| Common | 2 | 4.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 13 | |
| I | 7 | |
| R | 5 | 10.6% |
| C | 5 | 10.6% |
| N | 3 | 6.4% |
| E | 3 | 6.4% |
| F | 2 | 4.3% |
| O | 2 | 4.3% |
| T | 2 | 4.3% |
| M | 2 | 4.3% |
| Other values (3) | 3 | 6.4% |
Common
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 49 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 13 | |
| I | 7 | |
| R | 5 | 10.2% |
| C | 5 | 10.2% |
| N | 3 | 6.1% |
| E | 3 | 6.1% |
| F | 2 | 4.1% |
| O | 2 | 4.1% |
| T | 2 | 4.1% |
| _ | 2 | 4.1% |
| Other values (4) | 5 | 10.2% |
taxonRank
Text
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 7 |
| Mean length | 6.796793582 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SPECIES |
|---|---|
| 2nd row | GENUS |
| 3rd row | SPECIES |
| 4th row | SPECIES |
| 5th row | SPECIES |
| Value | Count | Frequency (%) |
| species | 376767 | |
| genus | 46673 | 10.3% |
| family | 22827 | 5.0% |
| subspecies | 8175 | 1.8% |
| order | 347 | 0.1% |
| kingdom | 204 | < 0.1% |
| phylum | 198 | < 0.1% |
| variety | 12 | < 0.1% |
| north_america | 7 | < 0.1% |
| class | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 824736 | |
| E | 816923 | |
| I | 407992 | |
| P | 385140 | |
| C | 384951 | |
| U | 55046 | 1.8% |
| N | 46884 | 1.5% |
| G | 46877 | 1.5% |
| M | 23236 | 0.8% |
| Y | 23037 | 0.7% |
| Other values (12) | 79160 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3093975 | |
| Connector Punctuation | 7 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 824736 | |
| E | 816923 | |
| I | 407992 | |
| P | 385140 | |
| C | 384951 | |
| U | 55046 | 1.8% |
| N | 46884 | 1.5% |
| G | 46877 | 1.5% |
| M | 23236 | 0.8% |
| Y | 23037 | 0.7% |
| Other values (11) | 79153 | 2.6% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3093975 | |
| Common | 7 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 824736 | |
| E | 816923 | |
| I | 407992 | |
| P | 385140 | |
| C | 384951 | |
| U | 55046 | 1.8% |
| N | 46884 | 1.5% |
| G | 46877 | 1.5% |
| M | 23236 | 0.8% |
| Y | 23037 | 0.7% |
| Other values (11) | 79153 | 2.6% |
Common
| Value | Count | Frequency (%) |
| _ | 7 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3093982 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 824736 | |
| E | 816923 | |
| I | 407992 | |
| P | 385140 | |
| C | 384951 | |
| U | 55046 | 1.8% |
| N | 46884 | 1.5% |
| G | 46877 | 1.5% |
| M | 23236 | 0.8% |
| Y | 23037 | 0.7% |
| Other values (12) | 79160 | 2.6% |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 455210 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | SYC |
|---|---|
| 2nd row | PHL |
| Value | Count | Frequency (%) |
| syc | 1 | |
| phl | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 1 | |
| Y | 1 | |
| C | 1 | |
| P | 1 | |
| H | 1 | |
| L | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 6 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1 | |
| Y | 1 | |
| C | 1 | |
| P | 1 | |
| H | 1 | |
| L | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 1 | |
| Y | 1 | |
| C | 1 | |
| P | 1 | |
| H | 1 | |
| L | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 1 | |
| Y | 1 | |
| C | 1 | |
| P | 1 | |
| H | 1 | |
| L | 1 |
vernacularName
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 455210 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 10.5 |
| Mean length | 10.5 |
| Min length | 10 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Seychelles |
|---|---|
| 2nd row | Philippines |
| Value | Count | Frequency (%) |
| seychelles | 1 | |
| philippines | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 4 | |
| l | 3 | |
| i | 3 | |
| h | 2 | |
| s | 2 | |
| p | 2 | |
| S | 1 | 4.8% |
| y | 1 | 4.8% |
| c | 1 | 4.8% |
| P | 1 | 4.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 19 | |
| Uppercase Letter | 2 | 9.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 4 | |
| l | 3 | |
| i | 3 | |
| h | 2 | |
| s | 2 | |
| p | 2 | |
| y | 1 | 5.3% |
| c | 1 | 5.3% |
| n | 1 | 5.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1 | |
| P | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 21 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 4 | |
| l | 3 | |
| i | 3 | |
| h | 2 | |
| s | 2 | |
| p | 2 | |
| S | 1 | 4.8% |
| y | 1 | 4.8% |
| c | 1 | 4.8% |
| P | 1 | 4.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 4 | |
| l | 3 | |
| i | 3 | |
| h | 2 | |
| s | 2 | |
| p | 2 | |
| S | 1 | 4.8% |
| y | 1 | 4.8% |
| c | 1 | 4.8% |
| P | 1 | 4.8% |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 455210 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | SYC.20_1 |
|---|---|
| 2nd row | PHL.36_1 |
| Value | Count | Frequency (%) |
| syc.20_1 | 1 | |
| phl.36_1 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 2 | |
| _ | 2 | |
| 1 | 2 | |
| S | 1 | 6.2% |
| Y | 1 | 6.2% |
| C | 1 | 6.2% |
| 2 | 1 | 6.2% |
| 0 | 1 | 6.2% |
| P | 1 | 6.2% |
| H | 1 | 6.2% |
| Other values (3) | 3 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6 | |
| Uppercase Letter | 6 | |
| Other Punctuation | 2 | 12.5% |
| Connector Punctuation | 2 | 12.5% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1 | |
| Y | 1 | |
| C | 1 | |
| P | 1 | |
| H | 1 | |
| L | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 2 | 1 | |
| 0 | 1 | |
| 3 | 1 | |
| 6 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10 | |
| Latin | 6 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 2 | |
| _ | 2 | |
| 1 | 2 | |
| 2 | 1 | |
| 0 | 1 | |
| 3 | 1 | |
| 6 | 1 |
Latin
| Value | Count | Frequency (%) |
| S | 1 | |
| Y | 1 | |
| C | 1 | |
| P | 1 | |
| H | 1 | |
| L | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 2 | |
| _ | 2 | |
| 1 | 2 | |
| S | 1 | 6.2% |
| Y | 1 | 6.2% |
| C | 1 | 6.2% |
| 2 | 1 | 6.2% |
| 0 | 1 | 6.2% |
| P | 1 | 6.2% |
| H | 1 | 6.2% |
| Other values (3) | 3 |
taxonomicStatus
Text
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 209 |
| Missing (%) | < 0.1% |
| Memory size | 3.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 8 |
| Mean length | 7.910209383 |
| Min length | 6 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | ACCEPTED |
|---|---|
| 2nd row | ACCEPTED |
| 3rd row | SYNONYM |
| 4th row | ACCEPTED |
| 5th row | ACCEPTED |
| Value | Count | Frequency (%) |
| accepted | 413893 | |
| synonym | 40858 | 9.0% |
| doubtful | 250 | 0.1% |
| outer | 1 | < 0.1% |
| islands | 1 | < 0.1% |
| iloilo | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 827786 | |
| C | 827786 | |
| T | 414143 | |
| D | 414143 | |
| A | 413893 | |
| P | 413893 | |
| Y | 81716 | 2.3% |
| N | 81716 | 2.3% |
| O | 41109 | 1.1% |
| S | 40858 | 1.1% |
| Other values (18) | 42126 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3599153 | |
| Lowercase Letter | 15 | < 0.1% |
| Space Separator | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 827786 | |
| C | 827786 | |
| T | 414143 | |
| D | 414143 | |
| A | 413893 | |
| P | 413893 | |
| Y | 81716 | 2.3% |
| N | 81716 | 2.3% |
| O | 41109 | 1.1% |
| S | 40858 | 1.1% |
| Other values (6) | 42110 | 1.2% |
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 3 | |
| s | 2 | |
| o | 2 | |
| u | 1 | 6.7% |
| t | 1 | 6.7% |
| e | 1 | 6.7% |
| r | 1 | 6.7% |
| a | 1 | 6.7% |
| n | 1 | 6.7% |
| d | 1 | 6.7% |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3599168 | |
| Common | 1 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 827786 | |
| C | 827786 | |
| T | 414143 | |
| D | 414143 | |
| A | 413893 | |
| P | 413893 | |
| Y | 81716 | 2.3% |
| N | 81716 | 2.3% |
| O | 41109 | 1.1% |
| S | 40858 | 1.1% |
| Other values (17) | 42125 | 1.2% |
Common
| Value | Count | Frequency (%) |
| 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3599169 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 827786 | |
| C | 827786 | |
| T | 414143 | |
| D | 414143 | |
| A | 413893 | |
| P | 413893 | |
| Y | 81716 | 2.3% |
| N | 81716 | 2.3% |
| O | 41109 | 1.1% |
| S | 40858 | 1.1% |
| Other values (18) | 42126 | 1.2% |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 455211 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 11 |
| Min length | 11 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | PHL.36.21_1 |
|---|
| Value | Count | Frequency (%) |
| phl.36.21_1 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 2 | |
| 1 | 2 | |
| P | 1 | |
| H | 1 | |
| L | 1 | |
| 3 | 1 | |
| 6 | 1 | |
| 2 | 1 | |
| _ | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5 | |
| Uppercase Letter | 3 | |
| Other Punctuation | 2 | 18.2% |
| Connector Punctuation | 1 | 9.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 3 | 1 | |
| 6 | 1 | |
| 2 | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 | |
| H | 1 | |
| L | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8 | |
| Latin | 3 | 27.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 2 | |
| 1 | 2 | |
| 3 | 1 | |
| 6 | 1 | |
| 2 | 1 | |
| _ | 1 |
Latin
| Value | Count | Frequency (%) |
| P | 1 | |
| H | 1 | |
| L | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 2 | |
| 1 | 2 | |
| P | 1 | |
| H | 1 | |
| L | 1 | |
| 3 | 1 | |
| 6 | 1 | |
| 2 | 1 | |
| _ | 1 |
taxonRemarks
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 455211 |
| Missing (%) | > 99.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 11 |
| Min length | 11 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Iloilo City |
|---|
| Value | Count | Frequency (%) |
| iloilo | 1 | |
| city | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 2 | |
| o | 2 | |
| i | 2 | |
| I | 1 | |
| 1 | ||
| C | 1 | |
| t | 1 | |
| y | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8 | |
| Uppercase Letter | 2 | 18.2% |
| Space Separator | 1 | 9.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 2 | |
| o | 2 | |
| i | 2 | |
| t | 1 | |
| y | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 1 | |
| C | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10 | |
| Common | 1 | 9.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 2 | |
| o | 2 | |
| i | 2 | |
| I | 1 | |
| C | 1 | |
| t | 1 | |
| y | 1 |
Common
| Value | Count | Frequency (%) |
| 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 2 | |
| o | 2 | |
| i | 2 | |
| I | 1 | |
| 1 | ||
| C | 1 | |
| t | 1 | |
| y | 1 |
datasetKey
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 6 |
| Missing (%) | < 0.1% |
| Memory size | 3.5 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 35.99995167 |
| Min length | 14 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
|---|---|
| 2nd row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| 3rd row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| 4th row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| 5th row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| Value | Count | Frequency (%) |
| 821cc27a-e3bb-4bc5-ac34-89ada245069d | 455205 | |
| phl.36.21.66_1 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 1820820 | |
| a | 1820820 | |
| - | 1820820 | |
| 2 | 1365616 | |
| 4 | 1365615 | |
| b | 1365615 | |
| 3 | 910411 | 5.6% |
| d | 910410 | 5.6% |
| 9 | 910410 | 5.6% |
| 5 | 910410 | 5.6% |
| Other values (11) | 3186447 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8193697 | |
| Lowercase Letter | 6372870 | |
| Dash Punctuation | 1820820 | 11.1% |
| Other Punctuation | 3 | < 0.1% |
| Uppercase Letter | 3 | < 0.1% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1365616 | |
| 4 | 1365615 | |
| 3 | 910411 | |
| 9 | 910410 | |
| 5 | 910410 | |
| 8 | 910410 | |
| 6 | 455208 | 5.6% |
| 1 | 455207 | 5.6% |
| 7 | 455205 | 5.6% |
| 0 | 455205 | 5.6% |
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 1820820 | |
| a | 1820820 | |
| b | 1365615 | |
| d | 910410 | |
| e | 455205 | 7.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 | |
| H | 1 | |
| L | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1820820 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10014521 | |
| Latin | 6372873 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 1820820 | |
| 2 | 1365616 | |
| 4 | 1365615 | |
| 3 | 910411 | |
| 9 | 910410 | |
| 5 | 910410 | |
| 8 | 910410 | |
| 6 | 455208 | 4.5% |
| 1 | 455207 | 4.5% |
| 7 | 455205 | 4.5% |
| Other values (3) | 455209 | 4.5% |
Latin
| Value | Count | Frequency (%) |
| c | 1820820 | |
| a | 1820820 | |
| b | 1365615 | |
| d | 910410 | |
| e | 455205 | 7.1% |
| P | 1 | < 0.1% |
| H | 1 | < 0.1% |
| L | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16387394 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 1820820 | |
| a | 1820820 | |
| - | 1820820 | |
| 2 | 1365616 | |
| 4 | 1365615 | |
| b | 1365615 | |
| 3 | 910411 | 5.6% |
| d | 910410 | 5.6% |
| 9 | 910410 | 5.6% |
| 5 | 910410 | 5.6% |
| Other values (11) | 3186447 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 6 |
| Missing (%) | < 0.1% |
| Memory size | 3.5 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 2 |
| Mean length | 2.000015378 |
| Min length | 2 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | US |
|---|---|
| 2nd row | US |
| 3rd row | US |
| 4th row | US |
| 5th row | US |
| Value | Count | Frequency (%) |
| us | 455205 | |
| kahirupan | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 455205 | |
| S | 455205 | |
| a | 2 | < 0.1% |
| K | 1 | < 0.1% |
| h | 1 | < 0.1% |
| i | 1 | < 0.1% |
| r | 1 | < 0.1% |
| u | 1 | < 0.1% |
| p | 1 | < 0.1% |
| n | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 910411 | |
| Lowercase Letter | 8 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2 | |
| h | 1 | |
| i | 1 | |
| r | 1 | |
| u | 1 | |
| p | 1 | |
| n | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 455205 | |
| S | 455205 | |
| K | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 910419 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 455205 | |
| S | 455205 | |
| a | 2 | < 0.1% |
| K | 1 | < 0.1% |
| h | 1 | < 0.1% |
| i | 1 | < 0.1% |
| r | 1 | < 0.1% |
| u | 1 | < 0.1% |
| p | 1 | < 0.1% |
| n | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 910419 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 455205 | |
| S | 455205 | |
| a | 2 | < 0.1% |
| K | 1 | < 0.1% |
| h | 1 | < 0.1% |
| i | 1 | < 0.1% |
| r | 1 | < 0.1% |
| u | 1 | < 0.1% |
| p | 1 | < 0.1% |
| n | 1 | < 0.1% |
lastInterpreted
Text
| Distinct | 173327 |
|---|---|
| Distinct (%) | 38.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.5 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.99554933 |
| Min length | 2 |
Unique
| Unique | 50647 ? |
|---|---|
| Unique (%) | 11.1% |
Sample
| 1st row | 2024-12-02T13:56:09.099Z |
|---|---|
| 2nd row | 2024-12-02T13:56:08.596Z |
| 3rd row | 2024-12-02T13:59:51.375Z |
| 4th row | 2024-12-02T13:58:24.571Z |
| 5th row | 2024-12-02T13:56:08.212Z |
| Value | Count | Frequency (%) |
| 2024-12-02t13:57:53.333z | 14 | < 0.1% |
| 2024-12-02t13:57:01.873z | 14 | < 0.1% |
| 2024-12-02t13:57:04.016z | 13 | < 0.1% |
| 2024-12-02t13:57:52.916z | 13 | < 0.1% |
| 2024-12-02t13:57:28.109z | 13 | < 0.1% |
| 2024-12-02t13:57:41.128z | 13 | < 0.1% |
| 2024-12-02t13:58:01.465z | 13 | < 0.1% |
| 2024-12-02t13:57:03.178z | 13 | < 0.1% |
| 2024-12-02t13:57:30.416z | 13 | < 0.1% |
| 2024-12-02t13:57:30.873z | 12 | < 0.1% |
| Other values (173317) | 455081 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2078433 | |
| 0 | 1154714 | |
| 1 | 1148157 | |
| - | 910410 | |
| : | 910410 | |
| 4 | 731701 | 6.7% |
| 5 | 722652 | 6.6% |
| 3 | 721318 | 6.6% |
| T | 455206 | 4.2% |
| Z | 455205 | 4.2% |
| Other values (10) | 1634856 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7737081 | |
| Other Punctuation | 1365147 | 12.5% |
| Uppercase Letter | 910424 | 8.3% |
| Dash Punctuation | 910410 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2078433 | |
| 0 | 1154714 | |
| 1 | 1148157 | |
| 4 | 731701 | 9.5% |
| 5 | 722652 | 9.3% |
| 3 | 721318 | 9.3% |
| 7 | 349813 | 4.5% |
| 9 | 291367 | 3.8% |
| 6 | 275056 | 3.6% |
| 8 | 263870 | 3.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 455206 | |
| Z | 455205 | |
| L | 3 | < 0.1% |
| C | 3 | < 0.1% |
| N | 3 | < 0.1% |
| E | 2 | < 0.1% |
| D | 2 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 910410 | |
| . | 454737 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 910410 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10012638 | |
| Latin | 910424 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2078433 | |
| 0 | 1154714 | |
| 1 | 1148157 | |
| - | 910410 | |
| : | 910410 | |
| 4 | 731701 | 7.3% |
| 5 | 722652 | 7.2% |
| 3 | 721318 | 7.2% |
| . | 454737 | 4.5% |
| 7 | 349813 | 3.5% |
| Other values (3) | 830293 | 8.3% |
Latin
| Value | Count | Frequency (%) |
| T | 455206 | |
| Z | 455205 | |
| L | 3 | < 0.1% |
| C | 3 | < 0.1% |
| N | 3 | < 0.1% |
| E | 2 | < 0.1% |
| D | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10923062 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2078433 | |
| 0 | 1154714 | |
| 1 | 1148157 | |
| - | 910410 | |
| : | 910410 | |
| 4 | 731701 | 6.7% |
| 5 | 722652 | 6.6% |
| 3 | 721318 | 6.6% |
| T | 455206 | 4.2% |
| Z | 455205 | 4.2% |
| Other values (10) | 1634856 |
depth
Text
Missing 
| Distinct | 3057 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 246174 |
| Missing (%) | 54.1% |
| Memory size | 3.5 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 18 |
| Mean length | 3.877084549 |
| Min length | 3 |
Unique
| Unique | 590 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 49.5 |
|---|---|
| 2nd row | 2.3 |
| 3rd row | 41.5 |
| 4th row | 7.5 |
| 5th row | 3.5 |
| Value | Count | Frequency (%) |
| 0.5 | 14691 | 7.0% |
| 1.0 | 9522 | 4.6% |
| 1.5 | 7688 | 3.7% |
| 3.0 | 5171 | 2.5% |
| 4.0 | 5073 | 2.4% |
| 2.5 | 5043 | 2.4% |
| 0.0 | 4521 | 2.2% |
| 2.0 | 4468 | 2.1% |
| 3.5 | 4405 | 2.1% |
| 5.0 | 4020 | 1.9% |
| Other values (3047) | 144436 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 209038 | |
| 0 | 188741 | |
| 5 | 117616 | |
| 1 | 74706 | 9.2% |
| 2 | 53659 | 6.6% |
| 3 | 37477 | 4.6% |
| 4 | 33701 | 4.2% |
| 7 | 28173 | 3.5% |
| 6 | 25895 | 3.2% |
| 9 | 21572 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 601420 | |
| Other Punctuation | 209038 | 25.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 188741 | |
| 5 | 117616 | |
| 1 | 74706 | 12.4% |
| 2 | 53659 | 8.9% |
| 3 | 37477 | 6.2% |
| 4 | 33701 | 5.6% |
| 7 | 28173 | 4.7% |
| 6 | 25895 | 4.3% |
| 9 | 21572 | 3.6% |
| 8 | 19880 | 3.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 209038 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 810458 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 209038 | |
| 0 | 188741 | |
| 5 | 117616 | |
| 1 | 74706 | 9.2% |
| 2 | 53659 | 6.6% |
| 3 | 37477 | 4.6% |
| 4 | 33701 | 4.2% |
| 7 | 28173 | 3.5% |
| 6 | 25895 | 3.2% |
| 9 | 21572 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 810458 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 209038 | |
| 0 | 188741 | |
| 5 | 117616 | |
| 1 | 74706 | 9.2% |
| 2 | 53659 | 6.6% |
| 3 | 37477 | 4.6% |
| 4 | 33701 | 4.2% |
| 7 | 28173 | 3.5% |
| 6 | 25895 | 3.2% |
| 9 | 21572 | 2.7% |
depthAccuracy
Text
Missing 
| Distinct | 1206 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 266866 |
| Missing (%) | 58.6% |
| Memory size | 3.5 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 3 |
| Mean length | 3.532891593 |
| Min length | 3 |
Unique
| Unique | 220 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 10.5 |
|---|---|
| 2nd row | 2.3 |
| 3rd row | 4.5 |
| 4th row | 0.5 |
| 5th row | 1.5 |
| Value | Count | Frequency (%) |
| 0.0 | 39321 | |
| 0.5 | 19670 | 10.4% |
| 1.5 | 14947 | 7.9% |
| 1.0 | 14430 | 7.7% |
| 2.5 | 8360 | 4.4% |
| 2.0 | 7760 | 4.1% |
| 3.0 | 7373 | 3.9% |
| 5.0 | 4367 | 2.3% |
| 3.5 | 3758 | 2.0% |
| 0.25 | 3396 | 1.8% |
| Other values (1196) | 64964 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 209096 | |
| . | 188346 | |
| 5 | 96270 | |
| 1 | 52038 | 7.8% |
| 2 | 32769 | 4.9% |
| 9 | 24929 | 3.7% |
| 3 | 20005 | 3.0% |
| 4 | 15882 | 2.4% |
| 7 | 11418 | 1.7% |
| 6 | 8839 | 1.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 477060 | |
| Other Punctuation | 188346 | 28.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 209096 | |
| 5 | 96270 | |
| 1 | 52038 | 10.9% |
| 2 | 32769 | 6.9% |
| 9 | 24929 | 5.2% |
| 3 | 20005 | 4.2% |
| 4 | 15882 | 3.3% |
| 7 | 11418 | 2.4% |
| 6 | 8839 | 1.9% |
| 8 | 5814 | 1.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 188346 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 665406 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 209096 | |
| . | 188346 | |
| 5 | 96270 | |
| 1 | 52038 | 7.8% |
| 2 | 32769 | 4.9% |
| 9 | 24929 | 3.7% |
| 3 | 20005 | 3.0% |
| 4 | 15882 | 2.4% |
| 7 | 11418 | 1.7% |
| 6 | 8839 | 1.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 665406 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 209096 | |
| . | 188346 | |
| 5 | 96270 | |
| 1 | 52038 | 7.8% |
| 2 | 32769 | 4.9% |
| 9 | 24929 | 3.7% |
| 3 | 20005 | 3.0% |
| 4 | 15882 | 2.4% |
| 7 | 11418 | 1.7% |
| 6 | 8839 | 1.3% |
distanceFromCentroidInMeters
Text
Missing 
| Distinct | 43 |
|---|---|
| Distinct (%) | 4.7% |
| Missing | 454306 |
| Missing (%) | 99.8% |
| Memory size | 3.5 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 17 |
| Mean length | 17.29028698 |
| Min length | 16 |
Unique
| Unique | 13 ? |
|---|---|
| Unique (%) | 1.4% |
Sample
| 1st row | 3435.2993691323722 |
|---|---|
| 2nd row | 1914.9010623948639 |
| 3rd row | 3286.3383926848273 |
| 4th row | 4049.579332802943 |
| 5th row | 3435.2993691323722 |
| Value | Count | Frequency (%) |
| 3997.886559051776 | 149 | |
| 1914.9010623948639 | 85 | 9.4% |
| 4049.579332802943 | 75 | 8.3% |
| 3435.2993691323722 | 74 | 8.2% |
| 4315.889420844057 | 72 | 7.9% |
| 3469.315853887778 | 51 | 5.6% |
| 3286.3383926848273 | 50 | 5.5% |
| 3413.2475218601576 | 44 | 4.9% |
| 3868.839758506256 | 35 | 3.9% |
| 4088.010727125954 | 28 | 3.1% |
| Other values (33) | 243 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 1953 | |
| 9 | 1907 | |
| 8 | 1651 | |
| 5 | 1468 | |
| 4 | 1453 | |
| 2 | 1411 | |
| 7 | 1405 | |
| 6 | 1213 | |
| 0 | 1158 | |
| 1 | 1140 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 14759 | |
| Other Punctuation | 906 | 5.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 1953 | |
| 9 | 1907 | |
| 8 | 1651 | |
| 5 | 1468 | |
| 4 | 1453 | |
| 2 | 1411 | |
| 7 | 1405 | |
| 6 | 1213 | |
| 0 | 1158 | |
| 1 | 1140 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 906 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 15665 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 1953 | |
| 9 | 1907 | |
| 8 | 1651 | |
| 5 | 1468 | |
| 4 | 1453 | |
| 2 | 1411 | |
| 7 | 1405 | |
| 6 | 1213 | |
| 0 | 1158 | |
| 1 | 1140 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15665 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 1953 | |
| 9 | 1907 | |
| 8 | 1651 | |
| 5 | 1468 | |
| 4 | 1453 | |
| 2 | 1411 | |
| 7 | 1405 | |
| 6 | 1213 | |
| 0 | 1158 | |
| 1 | 1140 |
issue
Text
| Distinct | 224 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 15 |
| Missing (%) | < 0.1% |
| Memory size | 3.5 MiB |
Length
| Max length | 213 |
|---|---|
| Median length | 208 |
| Mean length | 86.91829032 |
| Min length | 46 |
Unique
| Unique | 58 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
|---|---|
| 2nd row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
| 3rd row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
| 4th row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
| 5th row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
| Value | Count | Frequency (%) |
| occurrence_status_inferred_from_individual_count | 145701 | |
| occurrence_status_inferred_from_individual_count;continent_derived_from_country;continent_invalid | 90868 | |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;continent_invalid | 74492 | |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;continent_derived_from_coordinates;continent_invalid | 73665 | |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84 | 23814 | 5.2% |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;geodetic_datum_invalid;continent_derived_from_coordinates;continent_invalid | 5912 | 1.3% |
| occurrence_status_inferred_from_individual_count;country_derived_from_coordinates;geodetic_datum_assumed_wgs84;continent_invalid | 4969 | 1.1% |
| occurrence_status_inferred_from_individual_count;country_coordinate_mismatch;geodetic_datum_assumed_wgs84;continent_invalid | 4544 | 1.0% |
| occurrence_status_inferred_from_individual_count;taxon_match_higherrank | 3432 | 0.8% |
| occurrence_status_inferred_from_individual_count;taxon_match_fuzzy | 2582 | 0.6% |
| Other values (214) | 25218 | 5.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| _ | 3794873 | |
| N | 3679859 | |
| E | 3402744 | 8.6% |
| I | 3339442 | 8.4% |
| T | 2939907 | 7.4% |
| R | 2888137 | 7.3% |
| D | 2760112 | 7.0% |
| C | 2717539 | 6.9% |
| O | 2539725 | 6.4% |
| U | 2340457 | 5.9% |
| Other values (18) | 9162150 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 34671685 | |
| Connector Punctuation | 3794873 | 9.6% |
| Other Punctuation | 696481 | 1.8% |
| Decimal Number | 401906 | 1.0% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 3679859 | |
| E | 3402744 | |
| I | 3339442 | |
| T | 2939907 | |
| R | 2888137 | |
| D | 2760112 | |
| C | 2717539 | |
| O | 2539725 | |
| U | 2340457 | 6.8% |
| A | 1753128 | 5.1% |
| Other values (14) | 6310635 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 200953 | |
| 4 | 200953 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 3794873 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 696481 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 34671685 | |
| Common | 4893260 | 12.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 3679859 | |
| E | 3402744 | |
| I | 3339442 | |
| T | 2939907 | |
| R | 2888137 | |
| D | 2760112 | |
| C | 2717539 | |
| O | 2539725 | |
| U | 2340457 | 6.8% |
| A | 1753128 | 5.1% |
| Other values (14) | 6310635 |
Common
| Value | Count | Frequency (%) |
| _ | 3794873 | |
| ; | 696481 | 14.2% |
| 8 | 200953 | 4.1% |
| 4 | 200953 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 39564945 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| _ | 3794873 | |
| N | 3679859 | |
| E | 3402744 | 8.6% |
| I | 3339442 | 8.4% |
| T | 2939907 | 7.4% |
| R | 2888137 | 7.3% |
| D | 2760112 | 7.0% |
| C | 2717539 | 6.9% |
| O | 2539725 | 6.4% |
| U | 2340457 | 5.9% |
| Other values (18) | 9162150 |
mediaType
Text
Missing 
| Distinct | 34 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 363819 |
| Missing (%) | 79.9% |
| Memory size | 3.5 MiB |
Length
| Max length | 659 |
|---|---|
| Median length | 10 |
| Mean length | 17.04920508 |
| Min length | 10 |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | StillImage |
|---|---|
| 2nd row | StillImage |
| 3rd row | StillImage |
| 4th row | StillImage |
| 5th row | StillImage |
| Value | Count | Frequency (%) |
| stillimage | 60095 | |
| stillimage;stillimage | 16175 | 17.7% |
| stillimage;stillimage;stillimage | 9136 | 10.0% |
| stillimage;stillimage;stillimage;stillimage | 3567 | 3.9% |
| stillimage;stillimage;stillimage;stillimage;stillimage | 1344 | 1.5% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 427 | 0.5% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 208 | 0.2% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 113 | 0.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 88 | 0.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 65 | 0.1% |
| Other values (24) | 175 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 299922 | |
| S | 149961 | |
| t | 149961 | |
| i | 149961 | |
| I | 149961 | |
| m | 149961 | |
| a | 149961 | |
| g | 149961 | |
| e | 149961 | |
| ; | 58568 | 3.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1199688 | |
| Uppercase Letter | 299922 | 19.2% |
| Other Punctuation | 58568 | 3.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 299922 | |
| t | 149961 | |
| i | 149961 | |
| m | 149961 | |
| a | 149961 | |
| g | 149961 | |
| e | 149961 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 149961 | |
| I | 149961 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 58568 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1499610 | |
| Common | 58568 | 3.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 299922 | |
| S | 149961 | |
| t | 149961 | |
| i | 149961 | |
| I | 149961 | |
| m | 149961 | |
| a | 149961 | |
| g | 149961 | |
| e | 149961 |
Common
| Value | Count | Frequency (%) |
| ; | 58568 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1558178 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 299922 | |
| S | 149961 | |
| t | 149961 | |
| i | 149961 | |
| I | 149961 | |
| m | 149961 | |
| a | 149961 | |
| g | 149961 | |
| e | 149961 | |
| ; | 58568 | 3.8% |
hasCoordinate
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 7 |
| Missing (%) | < 0.1% |
| Memory size | 3.5 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.558539559 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 254250 | |
| true | 200955 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 455205 | |
| f | 254250 | |
| a | 254250 | |
| l | 254250 | |
| s | 254250 | |
| t | 200955 | |
| r | 200955 | |
| u | 200955 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2075070 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 455205 | |
| f | 254250 | |
| a | 254250 | |
| l | 254250 | |
| s | 254250 | |
| t | 200955 | |
| r | 200955 | |
| u | 200955 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2075070 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 455205 | |
| f | 254250 | |
| a | 254250 | |
| l | 254250 | |
| s | 254250 | |
| t | 200955 | |
| r | 200955 | |
| u | 200955 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2075070 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 455205 | |
| f | 254250 | |
| a | 254250 | |
| l | 254250 | |
| s | 254250 | |
| t | 200955 | |
| r | 200955 | |
| u | 200955 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 7 |
| Missing (%) | < 0.1% |
| Memory size | 3.5 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.986522556 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 449070 | |
| true | 6135 | 1.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 455205 | |
| f | 449070 | |
| a | 449070 | |
| l | 449070 | |
| s | 449070 | |
| t | 6135 | 0.3% |
| r | 6135 | 0.3% |
| u | 6135 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2269890 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 455205 | |
| f | 449070 | |
| a | 449070 | |
| l | 449070 | |
| s | 449070 | |
| t | 6135 | 0.3% |
| r | 6135 | 0.3% |
| u | 6135 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2269890 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 455205 | |
| f | 449070 | |
| a | 449070 | |
| l | 449070 | |
| s | 449070 | |
| t | 6135 | 0.3% |
| r | 6135 | 0.3% |
| u | 6135 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2269890 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 455205 | |
| f | 449070 | |
| a | 449070 | |
| l | 449070 | |
| s | 449070 | |
| t | 6135 | 0.3% |
| r | 6135 | 0.3% |
| u | 6135 | 0.3% |
taxonKey
Text
| Distinct | 28364 |
|---|---|
| Distinct (%) | 6.2% |
| Missing | 7 |
| Missing (%) | < 0.1% |
| Memory size | 3.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.851581156 |
| Min length | 1 |
Unique
| Unique | 8055 ? |
|---|---|
| Unique (%) | 1.8% |
Sample
| 1st row | 5213106 |
|---|---|
| 2nd row | 7822511 |
| 3rd row | 5209002 |
| 4th row | 2359811 |
| 5th row | 2369651 |
| Value | Count | Frequency (%) |
| 4274 | 1630 | 0.4% |
| 2376138 | 1001 | 0.2% |
| 2359014 | 971 | 0.2% |
| 2360481 | 895 | 0.2% |
| 2367736 | 889 | 0.2% |
| 2361357 | 853 | 0.2% |
| 2359823 | 815 | 0.2% |
| 2358931 | 758 | 0.2% |
| 2365441 | 757 | 0.2% |
| 4253 | 730 | 0.2% |
| Other values (28354) | 445906 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 594777 | |
| 3 | 469007 | |
| 4 | 302909 | |
| 5 | 292779 | |
| 8 | 260314 | |
| 0 | 254319 | |
| 9 | 250457 | |
| 1 | 243983 | |
| 6 | 227063 | 7.3% |
| 7 | 223266 | 7.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3118874 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 594777 | |
| 3 | 469007 | |
| 4 | 302909 | |
| 5 | 292779 | |
| 8 | 260314 | |
| 0 | 254319 | |
| 9 | 250457 | |
| 1 | 243983 | |
| 6 | 227063 | 7.3% |
| 7 | 223266 | 7.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3118874 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 594777 | |
| 3 | 469007 | |
| 4 | 302909 | |
| 5 | 292779 | |
| 8 | 260314 | |
| 0 | 254319 | |
| 9 | 250457 | |
| 1 | 243983 | |
| 6 | 227063 | 7.3% |
| 7 | 223266 | 7.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3118874 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 594777 | |
| 3 | 469007 | |
| 4 | 302909 | |
| 5 | 292779 | |
| 8 | 260314 | |
| 0 | 254319 | |
| 9 | 250457 | |
| 1 | 243983 | |
| 6 | 227063 | 7.3% |
| 7 | 223266 | 7.2% |
acceptedTaxonKey
Text
| Distinct | 22054 |
|---|---|
| Distinct (%) | 4.8% |
| Missing | 211 |
| Missing (%) | < 0.1% |
| Memory size | 3.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.847697038 |
| Min length | 2 |
Unique
| Unique | 4768 ? |
|---|---|
| Unique (%) | 1.0% |
Sample
| 1st row | 5213106 |
|---|---|
| 2nd row | 7822511 |
| 3rd row | 5209001 |
| 4th row | 2359811 |
| 5th row | 2369651 |
| Value | Count | Frequency (%) |
| 4274 | 1630 | 0.4% |
| 2360481 | 1121 | 0.2% |
| 2359014 | 1113 | 0.2% |
| 2359823 | 1006 | 0.2% |
| 2376138 | 1001 | 0.2% |
| 2366967 | 904 | 0.2% |
| 2367736 | 893 | 0.2% |
| 2394503 | 857 | 0.2% |
| 2361357 | 853 | 0.2% |
| 2358931 | 760 | 0.2% |
| Other values (22044) | 444863 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 599111 | |
| 3 | 470277 | |
| 4 | 304204 | |
| 5 | 294417 | |
| 8 | 259388 | |
| 0 | 253770 | |
| 9 | 251072 | |
| 1 | 239346 | 7.7% |
| 7 | 223101 | 7.2% |
| 6 | 221023 | 7.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3115709 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 599111 | |
| 3 | 470277 | |
| 4 | 304204 | |
| 5 | 294417 | |
| 8 | 259388 | |
| 0 | 253770 | |
| 9 | 251072 | |
| 1 | 239346 | 7.7% |
| 7 | 223101 | 7.2% |
| 6 | 221023 | 7.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3115709 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 599111 | |
| 3 | 470277 | |
| 4 | 304204 | |
| 5 | 294417 | |
| 8 | 259388 | |
| 0 | 253770 | |
| 9 | 251072 | |
| 1 | 239346 | 7.7% |
| 7 | 223101 | 7.2% |
| 6 | 221023 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3115709 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 599111 | |
| 3 | 470277 | |
| 4 | 304204 | |
| 5 | 294417 | |
| 8 | 259388 | |
| 0 | 253770 | |
| 9 | 251072 | |
| 1 | 239346 | 7.7% |
| 7 | 223101 | 7.2% |
| 6 | 221023 | 7.1% |
kingdomKey
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 7 |
| Missing (%) | < 0.1% |
| Memory size | 3.5 MiB |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 454998 | |
| 0 | 207 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 454998 | |
| 0 | 207 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 455205 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 454998 | |
| 0 | 207 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 455205 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 454998 | |
| 0 | 207 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 455205 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 454998 | |
| 0 | 207 | < 0.1% |
phylumKey
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 292 |
| Missing (%) | 0.1% |
| Memory size | 3.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 44 |
|---|---|
| 2nd row | 44 |
| 3rd row | 44 |
| 4th row | 44 |
| 5th row | 44 |
| Value | Count | Frequency (%) |
| 44 | 454913 | |
| 54 | 7 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 909833 | |
| 5 | 7 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 909840 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 909833 | |
| 5 | 7 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 909840 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 909833 | |
| 5 | 7 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 909840 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 909833 | |
| 5 | 7 | < 0.1% |
classKey
Text
Missing 
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 444746 |
| Missing (%) | 97.7% |
| Memory size | 3.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 3 |
| Mean length | 3.486432257 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 121 |
|---|---|
| 2nd row | 11881065 |
| 3rd row | 11881065 |
| 4th row | 121 |
| 5th row | 121 |
| Value | Count | Frequency (%) |
| 121 | 8825 | |
| 11881065 | 565 | 5.4% |
| 7375758 | 514 | 4.9% |
| 120 | 362 | 3.5% |
| 119 | 150 | 1.4% |
| 11500725 | 28 | 0.3% |
| 11733052 | 14 | 0.1% |
| 367 | 7 | 0.1% |
| 131 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 20093 | |
| 2 | 9229 | |
| 5 | 1663 | 4.6% |
| 8 | 1644 | 4.5% |
| 7 | 1591 | 4.4% |
| 0 | 997 | 2.7% |
| 6 | 572 | 1.6% |
| 3 | 550 | 1.5% |
| 9 | 150 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 36489 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 20093 | |
| 2 | 9229 | |
| 5 | 1663 | 4.6% |
| 8 | 1644 | 4.5% |
| 7 | 1591 | 4.4% |
| 0 | 997 | 2.7% |
| 6 | 572 | 1.6% |
| 3 | 550 | 1.5% |
| 9 | 150 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 36489 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 20093 | |
| 2 | 9229 | |
| 5 | 1663 | 4.6% |
| 8 | 1644 | 4.5% |
| 7 | 1591 | 4.4% |
| 0 | 997 | 2.7% |
| 6 | 572 | 1.6% |
| 3 | 550 | 1.5% |
| 9 | 150 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 36489 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 20093 | |
| 2 | 9229 | |
| 5 | 1663 | 4.6% |
| 8 | 1644 | 4.5% |
| 7 | 1591 | 4.4% |
| 0 | 997 | 2.7% |
| 6 | 572 | 1.6% |
| 3 | 550 | 1.5% |
| 9 | 150 | 0.4% |
orderKey
Text
| Distinct | 64 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1008 |
| Missing (%) | 0.2% |
| Memory size | 3.5 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 3 |
| Mean length | 3.16821076 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 495 |
|---|---|
| 2nd row | 1067 |
| 3rd row | 587 |
| 4th row | 1153 |
| 5th row | 587 |
| Value | Count | Frequency (%) |
| 587 | 212582 | |
| 1153 | 33752 | 7.4% |
| 590 | 17672 | 3.9% |
| 537 | 17478 | 3.8% |
| 495 | 17113 | 3.8% |
| 708 | 14280 | 3.1% |
| 1306 | 13708 | 3.0% |
| 588 | 12320 | 2.7% |
| 774 | 12085 | 2.7% |
| 772 | 10526 | 2.3% |
| Other values (54) | 92688 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 341848 | |
| 7 | 323015 | |
| 8 | 297305 | |
| 1 | 117756 | 8.2% |
| 3 | 98457 | 6.8% |
| 9 | 78657 | 5.5% |
| 4 | 74471 | 5.2% |
| 0 | 65102 | 4.5% |
| 6 | 28721 | 2.0% |
| 2 | 13682 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1439014 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 341848 | |
| 7 | 323015 | |
| 8 | 297305 | |
| 1 | 117756 | 8.2% |
| 3 | 98457 | 6.8% |
| 9 | 78657 | 5.5% |
| 4 | 74471 | 5.2% |
| 0 | 65102 | 4.5% |
| 6 | 28721 | 2.0% |
| 2 | 13682 | 1.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1439014 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 341848 | |
| 7 | 323015 | |
| 8 | 297305 | |
| 1 | 117756 | 8.2% |
| 3 | 98457 | 6.8% |
| 9 | 78657 | 5.5% |
| 4 | 74471 | 5.2% |
| 0 | 65102 | 4.5% |
| 6 | 28721 | 2.0% |
| 2 | 13682 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1439014 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 341848 | |
| 7 | 323015 | |
| 8 | 297305 | |
| 1 | 117756 | 8.2% |
| 3 | 98457 | 6.8% |
| 9 | 78657 | 5.5% |
| 4 | 74471 | 5.2% |
| 0 | 65102 | 4.5% |
| 6 | 28721 | 2.0% |
| 2 | 13682 | 1.0% |
familyKey
Text
| Distinct | 554 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 840 |
| Missing (%) | 0.2% |
| Memory size | 3.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 4.022034808 |
| Min length | 4 |
Unique
| Unique | 16 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2953 |
|---|---|
| 2nd row | 8473 |
| 3rd row | 4274 |
| 4th row | 7336 |
| 5th row | 4256 |
| Value | Count | Frequency (%) |
| 7336 | 27640 | 6.1% |
| 4274 | 26017 | 5.7% |
| 4499 | 16208 | 3.6% |
| 8535 | 14638 | 3.2% |
| 4251 | 14508 | 3.2% |
| 4217 | 13553 | 3.0% |
| 4236 | 12381 | 2.7% |
| 8597 | 11376 | 2.5% |
| 7201 | 9124 | 2.0% |
| 2225 | 7881 | 1.7% |
| Other values (544) | 301046 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 265025 | |
| 4 | 244389 | |
| 5 | 233139 | |
| 7 | 194357 | |
| 8 | 187507 | |
| 3 | 177749 | |
| 1 | 146364 | |
| 6 | 145909 | |
| 9 | 143794 | |
| 0 | 89267 | 4.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1827500 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 265025 | |
| 4 | 244389 | |
| 5 | 233139 | |
| 7 | 194357 | |
| 8 | 187507 | |
| 3 | 177749 | |
| 1 | 146364 | |
| 6 | 145909 | |
| 9 | 143794 | |
| 0 | 89267 | 4.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1827500 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 265025 | |
| 4 | 244389 | |
| 5 | 233139 | |
| 7 | 194357 | |
| 8 | 187507 | |
| 3 | 177749 | |
| 1 | 146364 | |
| 6 | 145909 | |
| 9 | 143794 | |
| 0 | 89267 | 4.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1827500 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 265025 | |
| 4 | 244389 | |
| 5 | 233139 | |
| 7 | 194357 | |
| 8 | 187507 | |
| 3 | 177749 | |
| 1 | 146364 | |
| 6 | 145909 | |
| 9 | 143794 | |
| 0 | 89267 | 4.9% |
genusKey
Text
Missing 
| Distinct | 4426 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 23593 |
| Missing (%) | 5.2% |
| Memory size | 3.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.003530892 |
| Min length | 7 |
Unique
| Unique | 409 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 2404224 |
|---|---|
| 2nd row | 7822511 |
| 3rd row | 2378400 |
| 4th row | 2359788 |
| 5th row | 2356959 |
| Value | Count | Frequency (%) |
| 2382199 | 5026 | 1.2% |
| 2403463 | 4350 | 1.0% |
| 2394482 | 4347 | 1.0% |
| 2362128 | 4334 | 1.0% |
| 2369550 | 4239 | 1.0% |
| 2356953 | 3825 | 0.9% |
| 2381823 | 3118 | 0.7% |
| 5962165 | 3031 | 0.7% |
| 2379647 | 2923 | 0.7% |
| 2380069 | 2919 | 0.7% |
| Other values (4416) | 393507 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 592252 | |
| 3 | 524642 | |
| 4 | 287118 | |
| 9 | 256376 | |
| 6 | 254775 | |
| 5 | 239254 | |
| 8 | 231988 | 7.7% |
| 0 | 221607 | 7.3% |
| 7 | 209793 | 6.9% |
| 1 | 205052 | 6.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3022857 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 592252 | |
| 3 | 524642 | |
| 4 | 287118 | |
| 9 | 256376 | |
| 6 | 254775 | |
| 5 | 239254 | |
| 8 | 231988 | 7.7% |
| 0 | 221607 | 7.3% |
| 7 | 209793 | 6.9% |
| 1 | 205052 | 6.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3022857 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 592252 | |
| 3 | 524642 | |
| 4 | 287118 | |
| 9 | 256376 | |
| 6 | 254775 | |
| 5 | 239254 | |
| 8 | 231988 | 7.7% |
| 0 | 221607 | 7.3% |
| 7 | 209793 | 6.9% |
| 1 | 205052 | 6.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3022857 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 592252 | |
| 3 | 524642 | |
| 4 | 287118 | |
| 9 | 256376 | |
| 6 | 254775 | |
| 5 | 239254 | |
| 8 | 231988 | 7.7% |
| 0 | 221607 | 7.3% |
| 7 | 209793 | 6.9% |
| 1 | 205052 | 6.8% |
speciesKey
Text
Missing 
| Distinct | 19431 |
|---|---|
| Distinct (%) | 5.0% |
| Missing | 70260 |
| Missing (%) | 15.4% |
| Memory size | 3.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.002387311 |
| Min length | 7 |
Unique
| Unique | 4217 ? |
|---|---|
| Unique (%) | 1.1% |
Sample
| 1st row | 5213106 |
|---|---|
| 2nd row | 5209001 |
| 3rd row | 2359811 |
| 4th row | 2369651 |
| 5th row | 2403057 |
| Value | Count | Frequency (%) |
| 2360481 | 1121 | 0.3% |
| 2359014 | 1113 | 0.3% |
| 2359823 | 1006 | 0.3% |
| 2361357 | 943 | 0.2% |
| 2365439 | 938 | 0.2% |
| 2366967 | 904 | 0.2% |
| 2367736 | 893 | 0.2% |
| 2394503 | 857 | 0.2% |
| 2365441 | 760 | 0.2% |
| 2358931 | 760 | 0.2% |
| Other values (19421) | 375657 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 523356 | |
| 3 | 406179 | |
| 4 | 259302 | |
| 5 | 256480 | |
| 0 | 226170 | |
| 8 | 225603 | |
| 9 | 216295 | |
| 1 | 207921 | 7.7% |
| 6 | 188342 | 7.0% |
| 7 | 185935 | 6.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2695583 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 523356 | |
| 3 | 406179 | |
| 4 | 259302 | |
| 5 | 256480 | |
| 0 | 226170 | |
| 8 | 225603 | |
| 9 | 216295 | |
| 1 | 207921 | 7.7% |
| 6 | 188342 | 7.0% |
| 7 | 185935 | 6.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2695583 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 523356 | |
| 3 | 406179 | |
| 4 | 259302 | |
| 5 | 256480 | |
| 0 | 226170 | |
| 8 | 225603 | |
| 9 | 216295 | |
| 1 | 207921 | 7.7% |
| 6 | 188342 | 7.0% |
| 7 | 185935 | 6.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2695583 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 523356 | |
| 3 | 406179 | |
| 4 | 259302 | |
| 5 | 256480 | |
| 0 | 226170 | |
| 8 | 225603 | |
| 9 | 216295 | |
| 1 | 207921 | 7.7% |
| 6 | 188342 | 7.0% |
| 7 | 185935 | 6.9% |
species
Text
Missing 
| Distinct | 19429 |
|---|---|
| Distinct (%) | 5.0% |
| Missing | 70260 |
| Missing (%) | 15.4% |
| Memory size | 3.5 MiB |
Length
| Max length | 35 |
|---|---|
| Median length | 31 |
| Mean length | 19.79614082 |
| Min length | 8 |
Unique
| Unique | 4217 ? |
|---|---|
| Unique (%) | 1.1% |
Sample
| 1st row | Echidna nebulosa |
|---|---|
| 2nd row | Myersina filifer |
| 3rd row | Rhinichthys cataractae |
| 4th row | Centropomus ensiferus |
| 5th row | Gorgasia inferomaculata |
| Value | Count | Frequency (%) |
| etheostoma | 4924 | 0.6% |
| chaetodon | 4170 | 0.5% |
| notropis | 4110 | 0.5% |
| lepomis | 4038 | 0.5% |
| gymnothorax | 4025 | 0.5% |
| lutjanus | 3782 | 0.5% |
| chromis | 2870 | 0.4% |
| halichoeres | 2861 | 0.4% |
| synodus | 2680 | 0.3% |
| acanthurus | 2550 | 0.3% |
| Other values (15283) | 733894 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 711716 | 9.3% |
| a | 681062 | 8.9% |
| i | 624744 | 8.2% |
| o | 551041 | 7.2% |
| u | 497264 | 6.5% |
| e | 491012 | 6.4% |
| r | 459068 | 6.0% |
| t | 435608 | 5.7% |
| n | 408570 | 5.4% |
| 384952 | 5.1% | |
| Other values (44) | 2375527 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6850578 | |
| Space Separator | 384952 | 5.1% |
| Uppercase Letter | 384952 | 5.1% |
| Dash Punctuation | 82 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 711716 | |
| a | 681062 | |
| i | 624744 | 9.1% |
| o | 551041 | 8.0% |
| u | 497264 | 7.3% |
| e | 491012 | 7.2% |
| r | 459068 | 6.7% |
| t | 435608 | 6.4% |
| n | 408570 | 6.0% |
| l | 337102 | 4.9% |
| Other values (16) | 1653391 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 55562 | |
| P | 46179 | |
| S | 43666 | |
| A | 35670 | |
| E | 25750 | 6.7% |
| L | 24627 | 6.4% |
| H | 21929 | 5.7% |
| M | 21577 | 5.6% |
| N | 17672 | 4.6% |
| G | 15987 | 4.2% |
| Other values (16) | 76333 |
Space Separator
| Value | Count | Frequency (%) |
| 384952 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 82 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7235530 | |
| Common | 385034 | 5.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 711716 | 9.8% |
| a | 681062 | 9.4% |
| i | 624744 | 8.6% |
| o | 551041 | 7.6% |
| u | 497264 | 6.9% |
| e | 491012 | 6.8% |
| r | 459068 | 6.3% |
| t | 435608 | 6.0% |
| n | 408570 | 5.6% |
| l | 337102 | 4.7% |
| Other values (42) | 2038343 |
Common
| Value | Count | Frequency (%) |
| 384952 | ||
| - | 82 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7620564 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 711716 | 9.3% |
| a | 681062 | 8.9% |
| i | 624744 | 8.2% |
| o | 551041 | 7.2% |
| u | 497264 | 6.5% |
| e | 491012 | 6.4% |
| r | 459068 | 6.0% |
| t | 435608 | 5.7% |
| n | 408570 | 5.4% |
| 384952 | 5.1% | |
| Other values (44) | 2375527 |
| Distinct | 22054 |
|---|---|
| Distinct (%) | 4.8% |
| Missing | 211 |
| Missing (%) | < 0.1% |
| Memory size | 3.5 MiB |
Length
| Max length | 111 |
|---|---|
| Median length | 88 |
| Mean length | 34.4021156 |
| Min length | 7 |
Unique
| Unique | 4768 ? |
|---|---|
| Unique (%) | 1.0% |
Sample
| 1st row | Echidna nebulosa (Ahl, 1789) |
|---|---|
| 2nd row | Mugil Linnaeus, 1758 |
| 3rd row | Myersina filifer (Valenciennes, 1837) |
| 4th row | Rhinichthys cataractae (Valenciennes, 1842) |
| 5th row | Centropomus ensiferus Poey, 1860 |
| Value | Count | Frequency (%) |
| 73674 | 4.0% | |
| linnaeus | 27502 | 1.5% |
| bleeker | 24276 | 1.3% |
| 1758 | 21716 | 1.2% |
| valenciennes | 20911 | 1.1% |
| cuvier | 18805 | 1.0% |
| bloch | 16076 | 0.9% |
| jordan | 16047 | 0.9% |
| lacepède | 14474 | 0.8% |
| günther | 13679 | 0.7% |
| Other values (18045) | 1608922 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1401081 | 9.0% | |
| e | 1049108 | 6.7% |
| a | 1030592 | 6.6% |
| i | 924453 | 5.9% |
| s | 912612 | 5.8% |
| n | 776988 | 5.0% |
| r | 763435 | 4.9% |
| o | 760269 | 4.9% |
| u | 639070 | 4.1% |
| l | 589065 | 3.8% |
| Other values (80) | 6806324 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10559704 | |
| Decimal Number | 1720212 | 11.0% |
| Space Separator | 1401081 | 9.0% |
| Uppercase Letter | 968743 | 6.2% |
| Other Punctuation | 507195 | 3.2% |
| Open Punctuation | 246790 | 1.6% |
| Close Punctuation | 246790 | 1.6% |
| Dash Punctuation | 2482 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1049108 | |
| a | 1030592 | |
| i | 924453 | 8.8% |
| s | 912612 | 8.6% |
| n | 776988 | 7.4% |
| r | 763435 | 7.2% |
| o | 760269 | 7.2% |
| u | 639070 | 6.1% |
| l | 589065 | 5.6% |
| t | 574724 | 5.4% |
| Other values (34) | 2539388 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 101740 | |
| S | 98482 | |
| B | 89774 | 9.3% |
| L | 87106 | 9.0% |
| G | 84761 | 8.7% |
| P | 66896 | 6.9% |
| A | 53025 | 5.5% |
| R | 50566 | 5.2% |
| M | 48486 | 5.0% |
| E | 42815 | 4.4% |
| Other values (18) | 245092 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 503738 | |
| 8 | 362257 | |
| 9 | 175589 | 10.2% |
| 7 | 134109 | 7.8% |
| 5 | 112293 | 6.5% |
| 0 | 107320 | 6.2% |
| 2 | 89202 | 5.2% |
| 6 | 84607 | 4.9% |
| 3 | 84274 | 4.9% |
| 4 | 66823 | 3.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 433183 | |
| & | 73674 | 14.5% |
| . | 263 | 0.1% |
| ' | 75 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1401081 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 246790 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 246790 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2482 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11528447 | |
| Common | 4124550 | 26.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1049108 | 9.1% |
| a | 1030592 | 8.9% |
| i | 924453 | 8.0% |
| s | 912612 | 7.9% |
| n | 776988 | 6.7% |
| r | 763435 | 6.6% |
| o | 760269 | 6.6% |
| u | 639070 | 5.5% |
| l | 589065 | 5.1% |
| t | 574724 | 5.0% |
| Other values (62) | 3508131 |
Common
| Value | Count | Frequency (%) |
| 1401081 | ||
| 1 | 503738 | 12.2% |
| , | 433183 | 10.5% |
| 8 | 362257 | 8.8% |
| ( | 246790 | 6.0% |
| ) | 246790 | 6.0% |
| 9 | 175589 | 4.3% |
| 7 | 134109 | 3.3% |
| 5 | 112293 | 2.7% |
| 0 | 107320 | 2.6% |
| Other values (8) | 401400 | 9.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15595063 | |
| None | 57934 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1401081 | 9.0% | |
| e | 1049108 | 6.7% |
| a | 1030592 | 6.6% |
| i | 924453 | 5.9% |
| s | 912612 | 5.9% |
| n | 776988 | 5.0% |
| r | 763435 | 4.9% |
| o | 760269 | 4.9% |
| u | 639070 | 4.1% |
| l | 589065 | 3.8% |
| Other values (60) | 6748390 |
None
| Value | Count | Frequency (%) |
| ü | 25216 | |
| è | 14495 | |
| å | 11815 | |
| ö | 2996 | 5.2% |
| é | 2103 | 3.6% |
| ø | 575 | 1.0% |
| á | 277 | 0.5% |
| ó | 158 | 0.3% |
| ă | 111 | 0.2% |
| ç | 56 | 0.1% |
| Other values (10) | 132 | 0.2% |
| Distinct | 30202 |
|---|---|
| Distinct (%) | 6.6% |
| Missing | 14 |
| Missing (%) | < 0.1% |
| Memory size | 3.5 MiB |
Length
| Max length | 69 |
|---|---|
| Median length | 54 |
| Mean length | 18.57410402 |
| Min length | 2 |
Unique
| Unique | 9327 ? |
|---|---|
| Unique (%) | 2.0% |
Sample
| 1st row | Echidna nebulosa |
|---|---|
| 2nd row | Mugil |
| 3rd row | Cryptocentrus filifer |
| 4th row | Rhinichthys cataractae |
| 5th row | Centropomus ensiferus |
| Value | Count | Frequency (%) |
| notropis | 7207 | 0.8% |
| etheostoma | 4890 | 0.6% |
| chaetodon | 4339 | 0.5% |
| gymnothorax | 4324 | 0.5% |
| lepomis | 4273 | 0.5% |
| lutjanus | 3888 | 0.5% |
| chromis | 3151 | 0.4% |
| halichoeres | 3126 | 0.4% |
| pomacentrus | 2957 | 0.3% |
| acanthurus | 2893 | 0.3% |
| Other values (18882) | 813929 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 773325 | 9.1% |
| a | 769363 | 9.1% |
| i | 700851 | 8.3% |
| o | 618984 | 7.3% |
| e | 559438 | 6.6% |
| u | 535543 | 6.3% |
| r | 508516 | 6.0% |
| t | 479216 | 5.7% |
| n | 446288 | 5.3% |
| 399779 | 4.7% | |
| Other values (60) | 2663592 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7598796 | |
| Uppercase Letter | 455383 | 5.4% |
| Space Separator | 399779 | 4.7% |
| Close Punctuation | 292 | < 0.1% |
| Open Punctuation | 292 | < 0.1% |
| Other Punctuation | 152 | < 0.1% |
| Dash Punctuation | 111 | < 0.1% |
| Decimal Number | 90 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 773325 | |
| a | 769363 | |
| i | 700851 | 9.2% |
| o | 618984 | 8.1% |
| e | 559438 | 7.4% |
| u | 535543 | 7.0% |
| r | 508516 | 6.7% |
| t | 479216 | 6.3% |
| n | 446288 | 5.9% |
| l | 365536 | 4.8% |
| Other values (16) | 1841736 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 63044 | |
| P | 53484 | |
| S | 52488 | |
| A | 43659 | |
| E | 29489 | 6.5% |
| L | 27979 | 6.1% |
| M | 27366 | 6.0% |
| H | 25637 | 5.6% |
| G | 21345 | 4.7% |
| N | 21144 | 4.6% |
| Other values (16) | 89748 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 39 | |
| 2 | 25 | |
| 3 | 15 | 16.7% |
| 4 | 3 | 3.3% |
| 6 | 3 | 3.3% |
| 9 | 2 | 2.2% |
| 5 | 2 | 2.2% |
| 7 | 1 | 1.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 84 | |
| / | 41 | |
| ? | 11 | 7.2% |
| & | 8 | 5.3% |
| † | 5 | 3.3% |
| # | 3 | 2.0% |
Space Separator
| Value | Count | Frequency (%) |
| 399779 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 292 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 292 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 111 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8054179 | |
| Common | 400716 | 4.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 773325 | 9.6% |
| a | 769363 | 9.6% |
| i | 700851 | 8.7% |
| o | 618984 | 7.7% |
| e | 559438 | 6.9% |
| u | 535543 | 6.6% |
| r | 508516 | 6.3% |
| t | 479216 | 5.9% |
| n | 446288 | 5.5% |
| l | 365536 | 4.5% |
| Other values (42) | 2297119 |
Common
| Value | Count | Frequency (%) |
| 399779 | ||
| ) | 292 | 0.1% |
| ( | 292 | 0.1% |
| - | 111 | < 0.1% |
| . | 84 | < 0.1% |
| / | 41 | < 0.1% |
| 1 | 39 | < 0.1% |
| 2 | 25 | < 0.1% |
| 3 | 15 | < 0.1% |
| ? | 11 | < 0.1% |
| Other values (8) | 27 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8454890 | |
| Punctuation | 5 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 773325 | 9.1% |
| a | 769363 | 9.1% |
| i | 700851 | 8.3% |
| o | 618984 | 7.3% |
| e | 559438 | 6.6% |
| u | 535543 | 6.3% |
| r | 508516 | 6.0% |
| t | 479216 | 5.7% |
| n | 446288 | 5.3% |
| 399779 | 4.7% | |
| Other values (59) | 2663587 |
Punctuation
| Value | Count | Frequency (%) |
| † | 5 |
protocol
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 7 |
| Missing (%) | < 0.1% |
| Memory size | 3.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | EML |
|---|---|
| 2nd row | EML |
| 3rd row | EML |
| 4th row | EML |
| 5th row | EML |
| Value | Count | Frequency (%) |
| eml | 455205 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 455205 | |
| M | 455205 | |
| L | 455205 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1365615 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 455205 | |
| M | 455205 | |
| L | 455205 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1365615 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 455205 | |
| M | 455205 | |
| L | 455205 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1365615 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 455205 | |
| M | 455205 | |
| L | 455205 |
lastParsed
Text
| Distinct | 173323 |
|---|---|
| Distinct (%) | 38.1% |
| Missing | 7 |
| Missing (%) | < 0.1% |
| Memory size | 3.5 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.99588757 |
| Min length | 20 |
Unique
| Unique | 50645 ? |
|---|---|
| Unique (%) | 11.1% |
Sample
| 1st row | 2024-12-02T13:56:09.099Z |
|---|---|
| 2nd row | 2024-12-02T13:56:08.596Z |
| 3rd row | 2024-12-02T13:59:51.375Z |
| 4th row | 2024-12-02T13:58:24.571Z |
| 5th row | 2024-12-02T13:56:08.212Z |
| Value | Count | Frequency (%) |
| 2024-12-02t13:57:53.333z | 14 | < 0.1% |
| 2024-12-02t13:57:01.873z | 14 | < 0.1% |
| 2024-12-02t13:57:03.178z | 13 | < 0.1% |
| 2024-12-02t13:57:41.128z | 13 | < 0.1% |
| 2024-12-02t13:57:28.109z | 13 | < 0.1% |
| 2024-12-02t13:57:52.916z | 13 | < 0.1% |
| 2024-12-02t13:57:04.016z | 13 | < 0.1% |
| 2024-12-02t13:57:30.416z | 13 | < 0.1% |
| 2024-12-02t13:58:01.465z | 13 | < 0.1% |
| 2024-12-02t13:57:21.641z | 12 | < 0.1% |
| Other values (173313) | 455074 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2078433 | |
| 0 | 1154714 | |
| 1 | 1148157 | |
| - | 910410 | |
| : | 910410 | |
| 4 | 731701 | 6.7% |
| 5 | 722652 | 6.6% |
| 3 | 721318 | 6.6% |
| T | 455205 | 4.2% |
| Z | 455205 | 4.2% |
| Other values (5) | 1634843 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7737081 | |
| Other Punctuation | 1365147 | 12.5% |
| Dash Punctuation | 910410 | 8.3% |
| Uppercase Letter | 910410 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2078433 | |
| 0 | 1154714 | |
| 1 | 1148157 | |
| 4 | 731701 | 9.5% |
| 5 | 722652 | 9.3% |
| 3 | 721318 | 9.3% |
| 7 | 349813 | 4.5% |
| 9 | 291367 | 3.8% |
| 6 | 275056 | 3.6% |
| 8 | 263870 | 3.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 910410 | |
| . | 454737 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 455205 | |
| Z | 455205 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 910410 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10012638 | |
| Latin | 910410 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2078433 | |
| 0 | 1154714 | |
| 1 | 1148157 | |
| - | 910410 | |
| : | 910410 | |
| 4 | 731701 | 7.3% |
| 5 | 722652 | 7.2% |
| 3 | 721318 | 7.2% |
| . | 454737 | 4.5% |
| 7 | 349813 | 3.5% |
| Other values (3) | 830293 | 8.3% |
Latin
| Value | Count | Frequency (%) |
| T | 455205 | |
| Z | 455205 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10923048 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2078433 | |
| 0 | 1154714 | |
| 1 | 1148157 | |
| - | 910410 | |
| : | 910410 | |
| 4 | 731701 | 6.7% |
| 5 | 722652 | 6.6% |
| 3 | 721318 | 6.6% |
| T | 455205 | 4.2% |
| Z | 455205 | 4.2% |
| Other values (5) | 1634843 |
lastCrawled
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 7 |
| Missing (%) | < 0.1% |
| Memory size | 3.5 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 24 |
| Min length | 24 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2024-12-02T11:48:23.416Z |
|---|---|
| 2nd row | 2024-12-02T11:48:23.416Z |
| 3rd row | 2024-12-02T11:48:23.416Z |
| 4th row | 2024-12-02T11:48:23.416Z |
| 5th row | 2024-12-02T11:48:23.416Z |
| Value | Count | Frequency (%) |
| 2024-12-02t11:48:23.416z | 455205 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2276025 | |
| 1 | 1820820 | |
| 4 | 1365615 | |
| 0 | 910410 | 8.3% |
| - | 910410 | 8.3% |
| : | 910410 | 8.3% |
| T | 455205 | 4.2% |
| 8 | 455205 | 4.2% |
| 3 | 455205 | 4.2% |
| . | 455205 | 4.2% |
| Other values (2) | 910410 | 8.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7738485 | |
| Other Punctuation | 1365615 | 12.5% |
| Dash Punctuation | 910410 | 8.3% |
| Uppercase Letter | 910410 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2276025 | |
| 1 | 1820820 | |
| 4 | 1365615 | |
| 0 | 910410 | 11.8% |
| 8 | 455205 | 5.9% |
| 3 | 455205 | 5.9% |
| 6 | 455205 | 5.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 910410 | |
| . | 455205 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 455205 | |
| Z | 455205 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 910410 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10014510 | |
| Latin | 910410 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2276025 | |
| 1 | 1820820 | |
| 4 | 1365615 | |
| 0 | 910410 | 9.1% |
| - | 910410 | 9.1% |
| : | 910410 | 9.1% |
| 8 | 455205 | 4.5% |
| 3 | 455205 | 4.5% |
| . | 455205 | 4.5% |
| 6 | 455205 | 4.5% |
Latin
| Value | Count | Frequency (%) |
| T | 455205 | |
| Z | 455205 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10924920 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2276025 | |
| 1 | 1820820 | |
| 4 | 1365615 | |
| 0 | 910410 | 8.3% |
| - | 910410 | 8.3% |
| : | 910410 | 8.3% |
| T | 455205 | 4.2% |
| 8 | 455205 | 4.2% |
| 3 | 455205 | 4.2% |
| . | 455205 | 4.2% |
| Other values (2) | 910410 | 8.3% |
repatriated
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 30397 |
| Missing (%) | 6.7% |
| Memory size | 3.5 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 4 |
| Mean length | 4.293219401 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | true |
| 4th row | false |
| 5th row | true |
| Value | Count | Frequency (%) |
| true | 300251 | |
| false | 124564 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 424815 | |
| t | 300251 | |
| r | 300251 | |
| u | 300251 | |
| f | 124564 | 6.8% |
| a | 124564 | 6.8% |
| l | 124564 | 6.8% |
| s | 124564 | 6.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1823824 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 424815 | |
| t | 300251 | |
| r | 300251 | |
| u | 300251 | |
| f | 124564 | 6.8% |
| a | 124564 | 6.8% |
| l | 124564 | 6.8% |
| s | 124564 | 6.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1823824 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 424815 | |
| t | 300251 | |
| r | 300251 | |
| u | 300251 | |
| f | 124564 | 6.8% |
| a | 124564 | 6.8% |
| l | 124564 | 6.8% |
| s | 124564 | 6.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1823824 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 424815 | |
| t | 300251 | |
| r | 300251 | |
| u | 300251 | |
| f | 124564 | 6.8% |
| a | 124564 | 6.8% |
| l | 124564 | 6.8% |
| s | 124564 | 6.8% |
isSequenced
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 7 |
| Missing (%) | < 0.1% |
| Memory size | 3.5 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.999011434 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 454755 | |
| true | 450 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 455205 | |
| f | 454755 | |
| a | 454755 | |
| l | 454755 | |
| s | 454755 | |
| t | 450 | < 0.1% |
| r | 450 | < 0.1% |
| u | 450 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2275575 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 455205 | |
| f | 454755 | |
| a | 454755 | |
| l | 454755 | |
| s | 454755 | |
| t | 450 | < 0.1% |
| r | 450 | < 0.1% |
| u | 450 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2275575 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 455205 | |
| f | 454755 | |
| a | 454755 | |
| l | 454755 | |
| s | 454755 | |
| t | 450 | < 0.1% |
| r | 450 | < 0.1% |
| u | 450 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2275575 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 455205 | |
| f | 454755 | |
| a | 454755 | |
| l | 454755 | |
| s | 454755 | |
| t | 450 | < 0.1% |
| r | 450 | < 0.1% |
| u | 450 | < 0.1% |
gbifRegion
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 32195 |
| Missing (%) | 7.1% |
| Memory size | 3.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 9.506102592 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | NORTH_AMERICA |
| 3rd row | ASIA |
| 4th row | NORTH_AMERICA |
| 5th row | LATIN_AMERICA |
| Value | Count | Frequency (%) |
| north_america | 127654 | |
| latin_america | 100745 | |
| asia | 94416 | |
| oceania | 68048 | |
| africa | 24873 | 5.9% |
| europe | 5998 | 1.4% |
| antarctica | 1283 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 936066 | |
| I | 517764 | |
| R | 388207 | |
| C | 323886 | 8.1% |
| E | 308443 | 7.7% |
| N | 297730 | 7.4% |
| T | 230965 | 5.7% |
| _ | 228399 | 5.7% |
| M | 228399 | 5.7% |
| O | 201700 | 5.0% |
| Other values (6) | 359684 | 8.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3792844 | |
| Connector Punctuation | 228399 | 5.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 936066 | |
| I | 517764 | |
| R | 388207 | |
| C | 323886 | 8.5% |
| E | 308443 | 8.1% |
| N | 297730 | 7.8% |
| T | 230965 | 6.1% |
| M | 228399 | 6.0% |
| O | 201700 | 5.3% |
| H | 127654 | 3.4% |
| Other values (5) | 232030 | 6.1% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 228399 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3792844 | |
| Common | 228399 | 5.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 936066 | |
| I | 517764 | |
| R | 388207 | |
| C | 323886 | 8.5% |
| E | 308443 | 8.1% |
| N | 297730 | 7.8% |
| T | 230965 | 6.1% |
| M | 228399 | 6.0% |
| O | 201700 | 5.3% |
| H | 127654 | 3.4% |
| Other values (5) | 232030 | 6.1% |
Common
| Value | Count | Frequency (%) |
| _ | 228399 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4021243 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 936066 | |
| I | 517764 | |
| R | 388207 | |
| C | 323886 | 8.1% |
| E | 308443 | 7.7% |
| N | 297730 | 7.4% |
| T | 230965 | 5.7% |
| _ | 228399 | 5.7% |
| M | 228399 | 5.7% |
| O | 201700 | 5.0% |
| Other values (6) | 359684 | 8.9% |
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 7 |
| Missing (%) | < 0.1% |
| Memory size | 3.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | NORTH_AMERICA |
| 3rd row | NORTH_AMERICA |
| 4th row | NORTH_AMERICA |
| 5th row | NORTH_AMERICA |
| Value | Count | Frequency (%) |
| north_america | 455205 |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 910410 | |
| A | 910410 | |
| N | 455205 | |
| O | 455205 | |
| T | 455205 | |
| H | 455205 | |
| _ | 455205 | |
| M | 455205 | |
| E | 455205 | |
| I | 455205 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 5462460 | |
| Connector Punctuation | 455205 | 7.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 910410 | |
| A | 910410 | |
| N | 455205 | |
| O | 455205 | |
| T | 455205 | |
| H | 455205 | |
| M | 455205 | |
| E | 455205 | |
| I | 455205 | |
| C | 455205 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 455205 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5462460 | |
| Common | 455205 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 910410 | |
| A | 910410 | |
| N | 455205 | |
| O | 455205 | |
| T | 455205 | |
| H | 455205 | |
| M | 455205 | |
| E | 455205 | |
| I | 455205 | |
| C | 455205 |
Common
| Value | Count | Frequency (%) |
| _ | 455205 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5917665 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 910410 | |
| A | 910410 | |
| N | 455205 | |
| O | 455205 | |
| T | 455205 | |
| H | 455205 | |
| _ | 455205 | |
| M | 455205 | |
| E | 455205 | |
| I | 455205 |
level0Gid
Text
Missing 
| Distinct | 139 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 407295 |
| Missing (%) | 89.5% |
| Memory size | 3.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 20 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | TON |
|---|---|
| 2nd row | PHL |
| 3rd row | BRA |
| 4th row | PAN |
| 5th row | IDN |
| Value | Count | Frequency (%) |
| usa | 11507 | |
| phl | 5147 | 10.7% |
| ven | 2888 | 6.0% |
| bra | 2826 | 5.9% |
| idn | 2406 | 5.0% |
| fji | 2133 | 4.5% |
| sur | 1264 | 2.6% |
| per | 1237 | 2.6% |
| png | 1100 | 2.3% |
| slb | 1025 | 2.1% |
| Other values (129) | 16384 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 17643 | |
| S | 16829 | |
| U | 15233 | 10.6% |
| N | 10380 | 7.2% |
| P | 9498 | 6.6% |
| L | 8389 | 5.8% |
| R | 7899 | 5.5% |
| H | 6313 | 4.4% |
| I | 5803 | 4.0% |
| E | 5461 | 3.8% |
| Other values (16) | 40303 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 143751 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 17643 | |
| S | 16829 | |
| U | 15233 | 10.6% |
| N | 10380 | 7.2% |
| P | 9498 | 6.6% |
| L | 8389 | 5.8% |
| R | 7899 | 5.5% |
| H | 6313 | 4.4% |
| I | 5803 | 4.0% |
| E | 5461 | 3.8% |
| Other values (16) | 40303 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 143751 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 17643 | |
| S | 16829 | |
| U | 15233 | 10.6% |
| N | 10380 | 7.2% |
| P | 9498 | 6.6% |
| L | 8389 | 5.8% |
| R | 7899 | 5.5% |
| H | 6313 | 4.4% |
| I | 5803 | 4.0% |
| E | 5461 | 3.8% |
| Other values (16) | 40303 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 143751 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 17643 | |
| S | 16829 | |
| U | 15233 | 10.6% |
| N | 10380 | 7.2% |
| P | 9498 | 6.6% |
| L | 8389 | 5.8% |
| R | 7899 | 5.5% |
| H | 6313 | 4.4% |
| I | 5803 | 4.0% |
| E | 5461 | 3.8% |
| Other values (16) | 40303 |
level0Name
Text
Missing 
| Distinct | 139 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 407295 |
| Missing (%) | 89.5% |
| Memory size | 3.5 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 24 |
| Mean length | 10.19049607 |
| Min length | 4 |
Unique
| Unique | 20 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Tonga |
|---|---|
| 2nd row | Philippines |
| 3rd row | Brazil |
| 4th row | Panama |
| 5th row | Indonesia |
| Value | Count | Frequency (%) |
| united | 11597 | |
| states | 11597 | |
| philippines | 5147 | 7.4% |
| venezuela | 2888 | 4.2% |
| brazil | 2826 | 4.1% |
| indonesia | 2406 | 3.5% |
| fiji | 2133 | 3.1% |
| new | 1388 | 2.0% |
| and | 1366 | 2.0% |
| islands | 1352 | 1.9% |
| Other values (173) | 26850 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 53763 | 11.0% |
| i | 52703 | 10.8% |
| a | 51807 | 10.6% |
| n | 41316 | 8.5% |
| t | 38500 | 7.9% |
| s | 27341 | 5.6% |
| 21633 | 4.4% | |
| d | 20330 | 4.2% |
| l | 20222 | 4.1% |
| S | 15363 | 3.1% |
| Other values (51) | 145320 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 398596 | |
| Uppercase Letter | 68018 | 13.9% |
| Space Separator | 21633 | 4.4% |
| Other Punctuation | 40 | < 0.1% |
| Open Punctuation | 5 | < 0.1% |
| Close Punctuation | 5 | < 0.1% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 53763 | |
| i | 52703 | |
| a | 51807 | |
| n | 41316 | |
| t | 38500 | |
| s | 27341 | |
| d | 20330 | 5.1% |
| l | 20222 | 5.1% |
| o | 15006 | 3.8% |
| r | 14531 | 3.6% |
| Other values (21) | 63077 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 15363 | |
| U | 11602 | |
| P | 9290 | |
| B | 4522 | 6.6% |
| I | 4345 | 6.4% |
| T | 3836 | 5.6% |
| V | 3518 | 5.2% |
| C | 3155 | 4.6% |
| F | 3136 | 4.6% |
| M | 2238 | 3.3% |
| Other values (14) | 7013 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 38 | |
| ' | 2 | 5.0% |
Space Separator
| Value | Count | Frequency (%) |
| 21633 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 5 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 5 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 466614 | |
| Common | 21684 | 4.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 53763 | |
| i | 52703 | |
| a | 51807 | 11.1% |
| n | 41316 | 8.9% |
| t | 38500 | 8.3% |
| s | 27341 | 5.9% |
| d | 20330 | 4.4% |
| l | 20222 | 4.3% |
| S | 15363 | 3.3% |
| o | 15006 | 3.2% |
| Other values (45) | 130263 |
Common
| Value | Count | Frequency (%) |
| 21633 | ||
| , | 38 | 0.2% |
| ( | 5 | < 0.1% |
| ) | 5 | < 0.1% |
| ' | 2 | < 0.1% |
| - | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 487641 | |
| None | 657 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 53763 | 11.0% |
| i | 52703 | 10.8% |
| a | 51807 | 10.6% |
| n | 41316 | 8.5% |
| t | 38500 | 7.9% |
| s | 27341 | 5.6% |
| 21633 | 4.4% | |
| d | 20330 | 4.2% |
| l | 20222 | 4.1% |
| S | 15363 | 3.2% |
| Other values (46) | 144663 |
None
| Value | Count | Frequency (%) |
| ç | 503 | |
| é | 150 | 22.8% |
| ô | 2 | 0.3% |
| ã | 1 | 0.2% |
| í | 1 | 0.2% |
level1Gid
Text
Missing 
| Distinct | 629 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 408402 |
| Missing (%) | 89.7% |
| Memory size | 3.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.589168981 |
| Min length | 6 |
Unique
| Unique | 134 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | TON.5_1 |
|---|---|
| 2nd row | PHL.52_1 |
| 3rd row | BRA.13_1 |
| 4th row | PAN.5_1 |
| 5th row | IDN.12_1 |
| Value | Count | Frequency (%) |
| usa.47_1 | 2114 | 4.5% |
| usa.39_1 | 1909 | 4.1% |
| usa.21_1 | 1178 | 2.5% |
| fji.4_1 | 1089 | 2.3% |
| phl.52_1 | 1010 | 2.2% |
| sur.9_1 | 986 | 2.1% |
| bra.4_1 | 986 | 2.1% |
| usa.49_1 | 966 | 2.1% |
| fji.2_1 | 918 | 2.0% |
| idn.19_1 | 915 | 2.0% |
| Other values (619) | 34739 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 61632 | |
| _ | 46806 | |
| . | 46779 | |
| A | 17604 | 5.0% |
| S | 16787 | 4.7% |
| U | 14730 | 4.1% |
| 2 | 12539 | 3.5% |
| 4 | 10903 | 3.1% |
| N | 10380 | 2.9% |
| 3 | 9530 | 2.7% |
| Other values (28) | 107559 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 140442 | |
| Decimal Number | 121222 | |
| Connector Punctuation | 46806 | 13.2% |
| Other Punctuation | 46779 | 13.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 17604 | |
| S | 16787 | |
| U | 14730 | 10.5% |
| N | 10380 | 7.4% |
| P | 9498 | 6.8% |
| L | 8367 | 6.0% |
| R | 7692 | 5.5% |
| H | 6317 | 4.5% |
| E | 5461 | 3.9% |
| B | 5444 | 3.9% |
| Other values (16) | 38162 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 61632 | |
| 2 | 12539 | 10.3% |
| 4 | 10903 | 9.0% |
| 3 | 9530 | 7.9% |
| 9 | 8556 | 7.1% |
| 5 | 5584 | 4.6% |
| 7 | 5302 | 4.4% |
| 6 | 2955 | 2.4% |
| 8 | 2789 | 2.3% |
| 0 | 1432 | 1.2% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 46806 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 46779 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 214807 | |
| Latin | 140442 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 17604 | |
| S | 16787 | |
| U | 14730 | 10.5% |
| N | 10380 | 7.4% |
| P | 9498 | 6.8% |
| L | 8367 | 6.0% |
| R | 7692 | 5.5% |
| H | 6317 | 4.5% |
| E | 5461 | 3.9% |
| B | 5444 | 3.9% |
| Other values (16) | 38162 |
Common
| Value | Count | Frequency (%) |
| 1 | 61632 | |
| _ | 46806 | |
| . | 46779 | |
| 2 | 12539 | 5.8% |
| 4 | 10903 | 5.1% |
| 3 | 9530 | 4.4% |
| 9 | 8556 | 4.0% |
| 5 | 5584 | 2.6% |
| 7 | 5302 | 2.5% |
| 6 | 2955 | 1.4% |
| Other values (2) | 4221 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 355249 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 61632 | |
| _ | 46806 | |
| . | 46779 | |
| A | 17604 | 5.0% |
| S | 16787 | 4.7% |
| U | 14730 | 4.1% |
| 2 | 12539 | 3.5% |
| 4 | 10903 | 3.1% |
| N | 10380 | 2.9% |
| 3 | 9530 | 2.7% |
| Other values (28) | 107559 |
level1Name
Text
Missing 
| Distinct | 611 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 408402 |
| Missing (%) | 89.7% |
| Memory size | 3.5 MiB |
Length
| Max length | 30 |
|---|---|
| Median length | 24 |
| Mean length | 9.236039308 |
| Min length | 3 |
Unique
| Unique | 130 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | Vava'u |
|---|---|
| 2nd row | Negros Oriental |
| 3rd row | Minas Gerais |
| 4th row | Darién |
| 5th row | Kalimantan Barat |
| Value | Count | Frequency (%) |
| virginia | 3080 | 4.8% |
| pennsylvania | 1909 | 3.0% |
| amazonas | 1611 | 2.5% |
| maryland | 1178 | 1.8% |
| south | 1090 | 1.7% |
| rotuma | 1089 | 1.7% |
| islands | 1083 | 1.7% |
| oriental | 1050 | 1.6% |
| negros | 1025 | 1.6% |
| eastern | 1020 | 1.6% |
| Other values (674) | 49891 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 64833 | |
| n | 33919 | 7.8% |
| i | 32416 | 7.5% |
| r | 26649 | 6.2% |
| o | 26487 | 6.1% |
| e | 26064 | 6.0% |
| s | 19671 | 4.5% |
| t | 19496 | 4.5% |
| l | 19259 | 4.5% |
| 17216 | 4.0% | |
| Other values (72) | 146329 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 350132 | |
| Uppercase Letter | 63320 | 14.6% |
| Space Separator | 17216 | 4.0% |
| Dash Punctuation | 1031 | 0.2% |
| Other Punctuation | 564 | 0.1% |
| Modifier Symbol | 48 | < 0.1% |
| Open Punctuation | 14 | < 0.1% |
| Close Punctuation | 14 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 64833 | |
| n | 33919 | |
| i | 32416 | |
| r | 26649 | |
| o | 26487 | 7.6% |
| e | 26064 | 7.4% |
| s | 19671 | 5.6% |
| t | 19496 | 5.6% |
| l | 19259 | 5.5% |
| u | 14915 | 4.3% |
| Other values (34) | 66423 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 7197 | 11.4% |
| M | 5791 | 9.1% |
| A | 5062 | 8.0% |
| B | 4023 | 6.4% |
| V | 3938 | 6.2% |
| T | 3895 | 6.2% |
| P | 3802 | 6.0% |
| N | 3754 | 5.9% |
| C | 3733 | 5.9% |
| O | 3489 | 5.5% |
| Other values (19) | 18636 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 552 | |
| / | 12 | 2.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 11 | |
| ( | 3 | 21.4% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 11 | |
| ) | 3 | 21.4% |
Space Separator
| Value | Count | Frequency (%) |
| 17216 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1031 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 48 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 413452 | |
| Common | 18887 | 4.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 64833 | |
| n | 33919 | 8.2% |
| i | 32416 | 7.8% |
| r | 26649 | 6.4% |
| o | 26487 | 6.4% |
| e | 26064 | 6.3% |
| s | 19671 | 4.8% |
| t | 19496 | 4.7% |
| l | 19259 | 4.7% |
| u | 14915 | 3.6% |
| Other values (63) | 129743 |
Common
| Value | Count | Frequency (%) |
| 17216 | ||
| - | 1031 | 5.5% |
| ' | 552 | 2.9% |
| ` | 48 | 0.3% |
| / | 12 | 0.1% |
| [ | 11 | 0.1% |
| ] | 11 | 0.1% |
| ( | 3 | < 0.1% |
| ) | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 427954 | |
| None | 4316 | 1.0% |
| Latin Ext Additional | 69 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 64833 | |
| n | 33919 | 7.9% |
| i | 32416 | 7.6% |
| r | 26649 | 6.2% |
| o | 26487 | 6.2% |
| e | 26064 | 6.1% |
| s | 19671 | 4.6% |
| t | 19496 | 4.6% |
| l | 19259 | 4.5% |
| 17216 | 4.0% | |
| Other values (51) | 141944 |
None
| Value | Count | Frequency (%) |
| á | 1512 | |
| Î | 928 | |
| é | 860 | |
| í | 392 | 9.1% |
| ó | 256 | 5.9% |
| ã | 136 | 3.2% |
| ò | 73 | 1.7% |
| ì | 68 | 1.6% |
| ö | 33 | 0.8% |
| ț | 14 | 0.3% |
| Other values (9) | 44 | 1.0% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ậ | 68 | |
| Ḥ | 1 | 1.4% |
level2Gid
Text
Missing 
| Distinct | 1834 |
|---|---|
| Distinct (%) | 4.2% |
| Missing | 412023 |
| Missing (%) | 90.5% |
| Memory size | 3.5 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 11 |
| Mean length | 10.0572368 |
| Min length | 7 |
Unique
| Unique | 435 ? |
|---|---|
| Unique (%) | 1.0% |
Sample
| 1st row | TON.5.0_1 |
|---|---|
| 2nd row | PHL.52.17_1 |
| 3rd row | BRA.13.511_2 |
| 4th row | PAN.5.2_1 |
| 5th row | IDN.12.14_1 |
| Value | Count | Frequency (%) |
| fji.4.1_1 | 1089 | 2.5% |
| sur.9.5_1 | 722 | 1.7% |
| fji.2.2_1 | 640 | 1.5% |
| slb.7.26_1 | 534 | 1.2% |
| ton.5.0_1 | 471 | 1.1% |
| idn.19.1_1 | 469 | 1.1% |
| ven.9.1_1 | 455 | 1.1% |
| per.17.4_1 | 448 | 1.0% |
| idn.19.6_1 | 444 | 1.0% |
| idn.28.2_1 | 443 | 1.0% |
| Other values (1824) | 37474 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 86343 | |
| 1 | 68426 | |
| _ | 43189 | 9.9% |
| 2 | 25854 | 6.0% |
| A | 17316 | 4.0% |
| 4 | 16359 | 3.8% |
| 3 | 15943 | 3.7% |
| S | 15249 | 3.5% |
| U | 14430 | 3.3% |
| 5 | 11495 | 2.6% |
| Other values (28) | 119758 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 175263 | |
| Uppercase Letter | 129567 | |
| Other Punctuation | 86343 | |
| Connector Punctuation | 43189 | 9.9% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 17316 | |
| S | 15249 | |
| U | 14430 | |
| N | 10350 | 8.0% |
| P | 8606 | 6.6% |
| L | 8302 | 6.4% |
| R | 7644 | 5.9% |
| H | 5908 | 4.6% |
| E | 5459 | 4.2% |
| I | 5168 | 4.0% |
| Other values (16) | 31135 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 68426 | |
| 2 | 25854 | 14.8% |
| 4 | 16359 | 9.3% |
| 3 | 15943 | 9.1% |
| 5 | 11495 | 6.6% |
| 9 | 11103 | 6.3% |
| 7 | 8897 | 5.1% |
| 6 | 7967 | 4.5% |
| 8 | 5561 | 3.2% |
| 0 | 3658 | 2.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 86343 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 43189 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 304795 | |
| Latin | 129567 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 17316 | |
| S | 15249 | |
| U | 14430 | |
| N | 10350 | 8.0% |
| P | 8606 | 6.6% |
| L | 8302 | 6.4% |
| R | 7644 | 5.9% |
| H | 5908 | 4.6% |
| E | 5459 | 4.2% |
| I | 5168 | 4.0% |
| Other values (16) | 31135 |
Common
| Value | Count | Frequency (%) |
| . | 86343 | |
| 1 | 68426 | |
| _ | 43189 | |
| 2 | 25854 | 8.5% |
| 4 | 16359 | 5.4% |
| 3 | 15943 | 5.2% |
| 5 | 11495 | 3.8% |
| 9 | 11103 | 3.6% |
| 7 | 8897 | 2.9% |
| 6 | 7967 | 2.6% |
| Other values (2) | 9219 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 434362 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 86343 | |
| 1 | 68426 | |
| _ | 43189 | 9.9% |
| 2 | 25854 | 6.0% |
| A | 17316 | 4.0% |
| 4 | 16359 | 3.8% |
| 3 | 15943 | 3.7% |
| S | 15249 | 3.5% |
| U | 14430 | 3.3% |
| 5 | 11495 | 2.6% |
| Other values (28) | 119758 |
level2Name
Text
Missing 
| Distinct | 1707 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 412026 |
| Missing (%) | 90.5% |
| Memory size | 3.5 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 28 |
| Mean length | 8.320659473 |
| Min length | 3 |
Unique
| Unique | 409 ? |
|---|---|
| Unique (%) | 0.9% |
Sample
| 1st row | n.a. |
|---|---|
| 2nd row | San Jose |
| 3rd row | Nanuque |
| 4th row | Pinogana |
| 5th row | Sintang |
| Value | Count | Frequency (%) |
| city | 2133 | 3.8% |
| rotuma | 1089 | 1.9% |
| kabalebo | 722 | 1.3% |
| san | 695 | 1.2% |
| lau | 640 | 1.1% |
| n.a | 549 | 1.0% |
| sikaiana | 534 | 1.0% |
| tengah | 518 | 0.9% |
| antonio | 476 | 0.8% |
| ambon | 469 | 0.8% |
| Other values (1915) | 48185 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 53679 | 14.9% |
| n | 26896 | 7.5% |
| o | 25683 | 7.1% |
| e | 21825 | 6.1% |
| i | 21332 | 5.9% |
| u | 16906 | 4.7% |
| r | 16255 | 4.5% |
| t | 15400 | 4.3% |
| l | 14249 | 4.0% |
| 12824 | 3.6% | |
| Other values (78) | 134287 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 287322 | |
| Uppercase Letter | 55894 | 15.6% |
| Space Separator | 12824 | 3.6% |
| Other Punctuation | 1664 | 0.5% |
| Dash Punctuation | 1383 | 0.4% |
| Decimal Number | 223 | 0.1% |
| Open Punctuation | 15 | < 0.1% |
| Close Punctuation | 10 | < 0.1% |
| Initial Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 53679 | |
| n | 26896 | |
| o | 25683 | 8.9% |
| e | 21825 | 7.6% |
| i | 21332 | 7.4% |
| u | 16906 | 5.9% |
| r | 16255 | 5.7% |
| t | 15400 | 5.4% |
| l | 14249 | 5.0% |
| s | 9165 | 3.2% |
| Other values (36) | 65932 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 6228 | 11.1% |
| M | 5541 | 9.9% |
| S | 4806 | 8.6% |
| B | 4081 | 7.3% |
| P | 3518 | 6.3% |
| A | 3325 | 5.9% |
| N | 3098 | 5.5% |
| T | 2991 | 5.4% |
| K | 2898 | 5.2% |
| L | 2826 | 5.1% |
| Other values (19) | 16582 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1239 | |
| ' | 412 | 24.8% |
| # | 11 | 0.7% |
| / | 2 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 152 | |
| 0 | 53 | 23.8% |
| 7 | 13 | 5.8% |
| 8 | 5 | 2.2% |
Space Separator
| Value | Count | Frequency (%) |
| 12824 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1383 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 15 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 10 |
Initial Punctuation
| Value | Count | Frequency (%) |
| ‹ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 343216 | |
| Common | 16120 | 4.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 53679 | |
| n | 26896 | 7.8% |
| o | 25683 | 7.5% |
| e | 21825 | 6.4% |
| i | 21332 | 6.2% |
| u | 16906 | 4.9% |
| r | 16255 | 4.7% |
| t | 15400 | 4.5% |
| l | 14249 | 4.2% |
| s | 9165 | 2.7% |
| Other values (65) | 121826 |
Common
| Value | Count | Frequency (%) |
| 12824 | ||
| - | 1383 | 8.6% |
| . | 1239 | 7.7% |
| ' | 412 | 2.6% |
| 1 | 152 | 0.9% |
| 0 | 53 | 0.3% |
| ( | 15 | 0.1% |
| 7 | 13 | 0.1% |
| # | 11 | 0.1% |
| ) | 10 | 0.1% |
| Other values (3) | 8 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 356398 | |
| None | 2869 | 0.8% |
| Latin Ext Additional | 68 | < 0.1% |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 53679 | |
| n | 26896 | 7.5% |
| o | 25683 | 7.2% |
| e | 21825 | 6.1% |
| i | 21332 | 6.0% |
| u | 16906 | 4.7% |
| r | 16255 | 4.6% |
| t | 15400 | 4.3% |
| l | 14249 | 4.0% |
| 12824 | 3.6% | |
| Other values (54) | 131349 |
None
| Value | Count | Frequency (%) |
| í | 750 | |
| á | 637 | |
| é | 514 | |
| ã | 232 | 8.1% |
| ó | 225 | 7.8% |
| ñ | 217 | 7.6% |
| ú | 117 | 4.1% |
| ç | 79 | 2.8% |
| ô | 33 | 1.2% |
| Ó | 23 | 0.8% |
| Other values (12) | 42 | 1.5% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ế | 68 |
Punctuation
| Value | Count | Frequency (%) |
| ‹ | 1 |
level3Gid
Text
Missing 
| Distinct | 763 |
|---|---|
| Distinct (%) | 5.5% |
| Missing | 441377 |
| Missing (%) | 97.0% |
| Memory size | 3.5 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 14 |
| Mean length | 12.36754608 |
| Min length | 11 |
Unique
| Unique | 214 ? |
|---|---|
| Unique (%) | 1.5% |
Sample
| 1st row | PHL.52.17.11_1 |
|---|---|
| 2nd row | PAN.5.2.4_1 |
| 3rd row | IDN.12.14.12_1 |
| 4th row | PHL.69.7.31_1 |
| 5th row | CMR.9.6.8_1 |
| Value | Count | Frequency (%) |
| idn.28.2.4_1 | 443 | 3.2% |
| bol.3.8.2_2 | 442 | 3.2% |
| per.18.3.4_1 | 329 | 2.4% |
| per.17.4.4_1 | 312 | 2.3% |
| idn.19.1.3_1 | 266 | 1.9% |
| cmr.9.6.8_1 | 253 | 1.8% |
| cmr.9.4.2_1 | 216 | 1.6% |
| phl.36.37.65_1 | 201 | 1.5% |
| phl.52.25.3_1 | 191 | 1.4% |
| phl.52.17.11_1 | 187 | 1.4% |
| Other values (753) | 10995 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 41505 | |
| 1 | 27915 | |
| _ | 13835 | 8.1% |
| 2 | 11592 | 6.8% |
| P | 7360 | 4.3% |
| 5 | 6289 | 3.7% |
| L | 6013 | 3.5% |
| H | 5824 | 3.4% |
| 3 | 5449 | 3.2% |
| 4 | 5221 | 3.1% |
| Other values (24) | 40102 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 74260 | |
| Other Punctuation | 41505 | |
| Uppercase Letter | 41505 | |
| Connector Punctuation | 13835 | 8.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 7360 | |
| L | 6013 | |
| H | 5824 | |
| N | 3826 | |
| R | 2786 | 6.7% |
| I | 2610 | 6.3% |
| D | 2583 | 6.2% |
| M | 2553 | 6.2% |
| A | 1776 | 4.3% |
| E | 1711 | 4.1% |
| Other values (12) | 4463 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 27915 | |
| 2 | 11592 | |
| 5 | 6289 | 8.5% |
| 3 | 5449 | 7.3% |
| 4 | 5221 | 7.0% |
| 6 | 4843 | 6.5% |
| 9 | 4402 | 5.9% |
| 7 | 3749 | 5.0% |
| 8 | 3548 | 4.8% |
| 0 | 1252 | 1.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 41505 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 13835 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 129600 | |
| Latin | 41505 | 24.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| P | 7360 | |
| L | 6013 | |
| H | 5824 | |
| N | 3826 | |
| R | 2786 | 6.7% |
| I | 2610 | 6.3% |
| D | 2583 | 6.2% |
| M | 2553 | 6.2% |
| A | 1776 | 4.3% |
| E | 1711 | 4.1% |
| Other values (12) | 4463 |
Common
| Value | Count | Frequency (%) |
| . | 41505 | |
| 1 | 27915 | |
| _ | 13835 | 10.7% |
| 2 | 11592 | 8.9% |
| 5 | 6289 | 4.9% |
| 3 | 5449 | 4.2% |
| 4 | 5221 | 4.0% |
| 6 | 4843 | 3.7% |
| 9 | 4402 | 3.4% |
| 7 | 3749 | 2.9% |
| Other values (2) | 4800 | 3.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 171105 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 41505 | |
| 1 | 27915 | |
| _ | 13835 | 8.1% |
| 2 | 11592 | 6.8% |
| P | 7360 | 4.3% |
| 5 | 6289 | 3.7% |
| L | 6013 | 3.5% |
| H | 5824 | 3.4% |
| 3 | 5449 | 3.2% |
| 4 | 5221 | 3.1% |
| Other values (24) | 40102 |
level3Name
Text
Missing 
| Distinct | 731 |
|---|---|
| Distinct (%) | 5.3% |
| Missing | 441442 |
| Missing (%) | 97.0% |
| Memory size | 3.5 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 25 |
| Mean length | 9.582207698 |
| Min length | 3 |
Unique
| Unique | 211 ? |
|---|---|
| Unique (%) | 1.5% |
Sample
| 1st row | Señora Ascion |
|---|---|
| 2nd row | Metetí |
| 3rd row | Sintang |
| 4th row | Pinontingan |
| 5th row | Mundemba |
| Value | Count | Frequency (%) |
| santa | 988 | 4.6% |
| poblacion | 541 | 2.5% |
| ana | 515 | 2.4% |
| timur | 501 | 2.4% |
| kabaena | 443 | 2.1% |
| san | 351 | 1.7% |
| tambopata | 329 | 1.5% |
| iquitos | 312 | 1.5% |
| barangay | 304 | 1.4% |
| de | 304 | 1.4% |
| Other values (882) | 16663 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 22534 | |
| n | 11149 | 8.4% |
| o | 9569 | 7.3% |
| 7481 | 5.7% | |
| i | 6726 | 5.1% |
| u | 6243 | 4.7% |
| r | 5659 | 4.3% |
| e | 5013 | 3.8% |
| t | 4320 | 3.3% |
| l | 3979 | 3.0% |
| Other values (81) | 49274 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 101297 | |
| Uppercase Letter | 20490 | 15.5% |
| Space Separator | 7481 | 5.7% |
| Decimal Number | 1246 | 0.9% |
| Other Punctuation | 733 | 0.6% |
| Open Punctuation | 279 | 0.2% |
| Close Punctuation | 272 | 0.2% |
| Dash Punctuation | 149 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 22534 | |
| n | 11149 | |
| o | 9569 | |
| i | 6726 | 6.6% |
| u | 6243 | 6.2% |
| r | 5659 | 5.6% |
| e | 5013 | 4.9% |
| t | 4320 | 4.3% |
| l | 3979 | 3.9% |
| b | 3282 | 3.2% |
| Other values (35) | 22823 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 2880 | |
| P | 1817 | 8.9% |
| T | 1788 | 8.7% |
| M | 1587 | 7.7% |
| A | 1506 | 7.3% |
| K | 1428 | 7.0% |
| B | 1222 | 6.0% |
| N | 1175 | 5.7% |
| I | 1107 | 5.4% |
| C | 1072 | 5.2% |
| Other values (17) | 4908 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 334 | |
| 2 | 324 | |
| 1 | 277 | |
| 9 | 119 | 9.6% |
| 8 | 65 | 5.2% |
| 3 | 47 | 3.8% |
| 0 | 42 | 3.4% |
| 5 | 22 | 1.8% |
| 7 | 14 | 1.1% |
| 4 | 2 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 577 | |
| , | 141 | 19.2% |
| " | 8 | 1.1% |
| ' | 6 | 0.8% |
| / | 1 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 7481 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 279 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 272 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 149 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 121787 | |
| Common | 10160 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 22534 | |
| n | 11149 | 9.2% |
| o | 9569 | 7.9% |
| i | 6726 | 5.5% |
| u | 6243 | 5.1% |
| r | 5659 | 4.6% |
| e | 5013 | 4.1% |
| t | 4320 | 3.5% |
| l | 3979 | 3.3% |
| b | 3282 | 2.7% |
| Other values (62) | 43313 |
Common
| Value | Count | Frequency (%) |
| 7481 | ||
| . | 577 | 5.7% |
| 6 | 334 | 3.3% |
| 2 | 324 | 3.2% |
| ( | 279 | 2.7% |
| 1 | 277 | 2.7% |
| ) | 272 | 2.7% |
| - | 149 | 1.5% |
| , | 141 | 1.4% |
| 9 | 119 | 1.2% |
| Other values (9) | 207 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 130475 | |
| None | 1373 | 1.0% |
| Latin Ext Additional | 99 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 22534 | |
| n | 11149 | 8.5% |
| o | 9569 | 7.3% |
| 7481 | 5.7% | |
| i | 6726 | 5.2% |
| u | 6243 | 4.8% |
| r | 5659 | 4.3% |
| e | 5013 | 3.8% |
| t | 4320 | 3.3% |
| l | 3979 | 3.0% |
| Other values (61) | 47802 |
None
| Value | Count | Frequency (%) |
| í | 518 | |
| ñ | 260 | |
| á | 230 | |
| é | 94 | 6.8% |
| ĩ | 63 | 4.6% |
| ư | 57 | 4.2% |
| ũ | 54 | 3.9% |
| ú | 40 | 2.9% |
| ó | 39 | 2.8% |
| Đ | 14 | 1.0% |
| Other values (4) | 4 | 0.3% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ờ | 41 | |
| ớ | 16 | 16.2% |
| ắ | 14 | 14.1% |
| ứ | 14 | 14.1% |
| ả | 13 | 13.1% |
| ậ | 1 | 1.0% |
Missing 
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 11501 |
| Missing (%) | 2.5% |
| Memory size | 3.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | LC |
|---|---|
| 2nd row | NE |
| 3rd row | LC |
| 4th row | LC |
| 5th row | LC |
| Value | Count | Frequency (%) |
| lc | 278407 | |
| ne | 139325 | |
| dd | 10088 | 2.3% |
| vu | 7110 | 1.6% |
| nt | 4625 | 1.0% |
| en | 2883 | 0.6% |
| cr | 1136 | 0.3% |
| ex | 119 | < 0.1% |
| ew | 18 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 279543 | |
| L | 278407 | |
| N | 146833 | |
| E | 142345 | |
| D | 20176 | 2.3% |
| V | 7110 | 0.8% |
| U | 7110 | 0.8% |
| T | 4625 | 0.5% |
| R | 1136 | 0.1% |
| X | 119 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 887422 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 279543 | |
| L | 278407 | |
| N | 146833 | |
| E | 142345 | |
| D | 20176 | 2.3% |
| V | 7110 | 0.8% |
| U | 7110 | 0.8% |
| T | 4625 | 0.5% |
| R | 1136 | 0.1% |
| X | 119 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 887422 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 279543 | |
| L | 278407 | |
| N | 146833 | |
| E | 142345 | |
| D | 20176 | 2.3% |
| V | 7110 | 0.8% |
| U | 7110 | 0.8% |
| T | 4625 | 0.5% |
| R | 1136 | 0.1% |
| X | 119 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 887422 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 279543 | |
| L | 278407 | |
| N | 146833 | |
| E | 142345 | |
| D | 20176 | 2.3% |
| V | 7110 | 0.8% |
| U | 7110 | 0.8% |
| T | 4625 | 0.5% |
| R | 1136 | 0.1% |
| X | 119 | < 0.1% |